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Abstract 

We propose a modification, based on the RESTART (repetitive simulation trials after 
reaching thresholds) and DPR (dynamics probability redistribution) rare event simu- 
lation algorithms, of the standard diffusion Monte Carlo (DMC) algorithm. The new 
algorithm has a lower variance per workload, regardless of the regime considered. In 
particular, it makes it feasible to use DMC in situations where the "naive" generalisation 
of the standard algorithm would be impractical, due to an exponential explosion of its 
variance. 

We numerically demonstrate the effectiveness of the new algorithm on a standard 
rare event simulation problem (probability of an unlikely transition in a Lennard- Jones 
cluster), as well as a high-frequency data assimilation problem. We then provide a 
detailed heuristic explanation of why, in the case of rare event simulation, the new 
algorithm is expected to converge to a limiting process as the underlying stepsize goes 
to 0. This is shown rigorously in the simplest possible situation of a random walk, 
biased by a linear potential. The resulting limiting object, which we call the "Brownian 
fan", is a very natural new mathematical object of independent interest. 

Keywords: Diffusion Monte Carlo, quantum Monte Carlo, rare event simulation, sequential Monte 
Carlo, particle filtering, Brownian fan, branching process 
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1 Introduction 



Diffusion Monte Carlo (DMC) is a well established method developed within the 
Quantum Monte Carlo (QMC) community to compute the ground state energy (the 
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lowest eigenvalue) of the Hamiltonian operator 

HiIj = -A^ + Vip 

as well as averages with respect to the square of the corresponding eigenfunction 
[Kal62, GS71, And75, CA80, KLll]. It is based on the fact that the Feynman-Kac 
formula ^ 

V'(x,Q=E,(^/(St)exp(- V(B,)ds^^ , (1.1) 
connects the solution, tp{x, t), of the partial differential equation 

i>io,-) = f, (1.2) 

to expectations of a standard Brownian motion starting at x. For large times t, 
suitably normalised integrals against the solution of (1.2) approximate normahsed 
integrals against the ground state eigenfunction of H. 

As we will see in the next section, DMC is an extremely flexible tool with many 
potential applications going well beyond quantum Monte Carlo. We will introduce 
a generalised version of DMC capable of solving problems arising in a wide variety 
of application areas. In fact, sequential importance sampUng [dDG05, Liu02] is a 
particular application of DMC in this general framework. In that setting the goal is to 
compute normalised expectations of the form 

E(/(t/«)exp(-E,,<tmj/tj)) 
Eexp(-Et,<tV^(fc,2/tj) 

where the Vik, •) are functions encoding, for example, the likelihood of sequentially 
arriving observations (at times ti < t2 < < • • • ) given the state of some Markov 
process yt. A central component of a sequential importance sampling scheme is a 
so-called "resampling" step (see e.g. [dDG05, Liu02]) in which copies of a system are 
weighted by the ratio of two densities and then resampled to produce an unweighted 
set of samples. Some popular resampling algorithms are adaptations of the generalised 
version of DMC that we will present below in Algorithm 1 (e.g. residual resampling as 
described in [Liu02]). Our results suggest that sequential importance sampling schemes 
could be improved (in some cases dramatically) by building the resampling step from 
our modification of DMC in Algorithm 2. 

In addition to existing algorithms such as sequential Monte Carlo that can be 
naturally cast as DMC, one can imagine several unexplored uses of DMC. For example, 
we will show that the generalisation of DMC in Algorithm 1 below could potentially be 
used to compute approximations to quantities of the form 

E(^/(2/i)exp(-^ V(yt)dyt)^ , 

or 

E{f{yt)exp{-V(yt))) . (1.3) 

where yt is a diffusion. The second of these expectations is particularly interesting. We 
will see that, in the context of computing expectations of this form, DMC becomes a 
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rare event simulation technique (see e.g. [Buc04, AGIO] and the references therein), 
i.e. a tool for efficiently generating samples of extremely low probability events and 
computing their probabilities. Such tools can be used to compute the probability or 
frequency of dramatic fluctuations in a stochastic process. Those fluctuations might, for 
example, characterise a reaction in a chemical system or a failure event in an electronic 
device (see e.g. [FS96, Buc04]). 

Unfortunately, in many of the most interesting settings, the natural generalisation 
of DMC introduced in Algorithm 1 exhibits a dramatic instabiUty. We show that this 
instability can be corrected by a slight enlargement of the state space to include a 
parameter dictating which part of the state space a sample is allowed to visit. The 
resulting method is introduced in Algorithm 2 in Section 2 below. This modification is 
inspired by the RESTART and DPR rare event simulation algorithms [VAVA91, HT99, 
DDll]. The RESTART and DPR algorithms bias an underlying Markov process by 
splitting the trajectory of the process into multiple trajectories and then appropriately 
reweighing the resulting trajectories. The splitting occurs every time the process moves 
from one set to another in a predefined partition of the state space. The key feature of 
the algorithms that we borrow is a restriction placed on the trajectories resulting from 
a splitting event. The new trajectories are only allowed to explore a certain region of 
state space and are eliminated when they exit this region. Our modification of DMC is 
similar, except that our branching rule is not necessarily based on a fixed partition of 
state space. 

In Section 3, we demonstrate the stability and utility of the new method in two 
appUcations. In the first one, we use the method to compute the probability of very 
unlikely transitions in a small cluster of Lennard- Jones particles. In the second example 
we show that the method can be used to efficiently assimilate high frequency observa- 
tional data from a chaotic diffusion. In addition to these computational examples, we 
provide an in-depth mathematical study of the new scheme (Algorithm 2). We show 
that regardless of parameter regime and even underlying dynamics (i.e. need not be 
related to a diffusion) the estimator generated by the new algorithm has lower variance 
per workload than the DMC estimator. 

In addition, by focusing on a particular asymptotic regime (small time-discretisation 
parameter) we are also able to rigorously establish several dramatic results concerning 
the stability of Algorithm 2 in settings in which the straightforward generalisation of 
DMC is unstable. In Section 5, we take a closer look at the situation (1.3) used for 
rare event simulation. We provide a heuristic explanation why one would expect our 
algorithm to converge to a limiting process as the discretisation parameter converges to 
and we give a non-rigorous characterisation of the limiting object which we call the 
"Brownian fan." We also provide a detailed study of the fine properties of this limiting 
process in the simplest case when t/t is a random walk (rescaled so that it converges to 
Brownian motion) and the biasing potential V is linear. The Brownian fan is interesting 
in its own right from a mathematical perspective and does not seem to have been studied 
before, though it is very closely related to the "Virgin island model" introduced in 
[Hut09]. 

The final part of this article. Section 6, is then devoted to a rigorous proof of the 
convergence of the output of Algorithm 2 to the Brownian fan for any sufficiently light- 
tailed one-step distribution of the underlying random walk. This result is formulated in 
Theorem 6.11. The Brownian fan has the unusual property that, while the number of 
particles alive at any fixed time t is finite with moments of all orders, there is a dense 
set of exceptional times at which it is infinite, due to the fact that offspring are created 
at infinite rate. As a consequence, an immediate challenge in our study of the Brownian 
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fan is finding a state space in which it is is a well-behaved Markov process (i.e. Feller 
and with continuous sample paths). 

1.1 Notations 

For any space y, we will denote by M^(y) the space of finite positive measures on y, 
endowed with the topology of weak convergence, together with the convergence of total 
mass. Given a random variable X, we denote its law by 'D(X), except in some cases 
where we introduce dedicated notations. 

We will often use the notation o < 6 as a shorthand for the inequality a < Cb for 
some constant C. The dependence of C on other quantities will usually be clear from the 
context, and will be indicated when ambiguities may arise. We will also use the standard 
notations a A 6 = min{a, b}, aW b = max{a, 6}, and [aj = max{i € Z : i < a}. 
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2 The algorithm and a summary of our main results 

With the potential term V removed, the PDE (1.2) is simply the familiar Fokker-Planck 
(or forward Kolmogorov) equation and one can of course approximate (1.1) (with 
V = 0)hy 

1 

i=i 

where the w^^ are M independent realisations of a Brownian motion. The probabiUstic 
interpretation of the source term Vij^ in the PDE is that it is responsible for the killing 
and creation of sample trajectories of w. In practice one cannot exactly compute the 
integral appearing in the exponent in (1.1). Common practice (assuming that tk = ke) 
is to replace the integral by the approximation 

rt Lt/^J-i , 

where e > is a small time-discretisation parameter. DMC then approximates 

/ Lt/eJ-l X 

(/)t=E( /(«;*) exp(- Y 2^yiwt,+.) + V^^tJ)e)). (2.1) 

The version of DMC that we state below in Algorithm 1 is a slight generalisation of the 
standard algorithm and approximates 



/ L*/eJ-l 

(/)t = E[ /(2/t)exp(- Y XiVke, y(k+i)i 




(2.2) 
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where yt is any Markov process and x is any function of two variables. Because it will 
be the case in our examples as well as in our in-depth case study, we have assumed that 
the Markov process is sampled at equal time intervals of length s. For simplicity we 
will henceforth also assume that tAsa multiple of e. These assumptions are by no means 
necessary. The DMC algorithm proceeds as follows: 

Algorithm 1 Slightly generalised DMC 

1 . Begin with M copies Xq^ = xq and k = 0. 

2. At step k there are N^e samples a;^]. Evolve each of these e 
units of time under the underlying dynamics to generate N^^ 

values 

^(i+l)£ ~ e dx I Vke = xfj. 

3 . For each j = 1, . . . , N^s , let 
and set 

where the v!-^^ are independent W(0, 1) random veiriables . 

4 . For j = l,...,Ns, and for i = 1, . . . , N^^^ set 

5. Finally, set N^k+Vie = and list the N(^k+i)£ vectors 

i^(fe+l)£j a^ i^(fe+l)£Jj = l 

6. At time t, produce the estimate 

Here, the notation W(o, b) was used to refer to the law of a random variable that is 
uniformly distributed in the interval (a, b). Below we will refer to the x[^* as particles 
and to a collection of particles as an ensemble. Algorithm 1 results in an imbiased 
estimate of (2.2), i.e. 

E\ft = (/)* . 

For reasonable choices of x and / (e.g. supi,^^^^{l)ke < oo and (|/|)t < oo) the law 
of large numbers then impUes that 

hm ft = {f)t. 

M-j-oo 

We use the symbol for expectations under the rules of Algorithm 1 to distinguish 
them for expectations under the rules of ovu* modified algorithm (Algorithm 2 below) 
which will simply be denoted by E. 
Of course if we set 

Xix,y)=liVix) + Viy))e, (2.3) 
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then we are back to the quantum Monte Carlo setting. We might, however, choose 

Xix,y) = Vix)iy-x), (2.4) 

which, if yt approximates a diffusion (which we also denote by yt), formally corresponds 
to an approximation of 

=E(^/(j/t)exp(- V(y,)dy,)^ , 

or 

X(x,y) = Viy)-Vix), (2.5) 
corresponding to an approximation of 

(/)i = e^(-'')E(/(2/i)e-^(^*)). 

The generalisation of DMC resulting from various choices of x is not frivolous. In 
Section 3 we will see that, with % of form (2.5), one can design potentially very useful 
rare event and particle filtering schemes. Unfortunately, when yt is a diffusion or a 
discretisation of a diffusion, the resulting Algorithms behave extremely poorly in the 
small discretisation size Umit (e — > 0). The reason is simple: on a time interval e, a 
typical step of Brownian motion moves by 0(y/e). Therefore, unlike for (2.3), for both 
(2.4) and (2.5), the typical size of x(x, y) is of order y^. As a consequence, at each 
step, the probability that a particle either dies or spawns a child is itself of order y^. 
Without modification, as £ ^ 0, the fluctuations in the number of particles, -^'^[t/ej . wUl 
grow wildly and the process will die out before a time of order 1 with higher and higher 
probability, thus causing the variance of our estimator to explode. This phenomenon is 
already evident in the simple case of a Brownian motion, 

y(k+i)s = yke + ^^k+i , XiO) = , (2.6) 

with 

Xix, y) = y-x (2.7) 

(a choice consistent with either (2.4) or (2.5)). Here the ^j, are independent mean 
and variance 1 Gaussian random variables. We will analyse this example (with more 
general ^fc) in great depth in Sections 5 and 6. For the moment. Figure 2 demonstrates 
the dramatic failure of the straightforward generalisation of DMC in Algorithm 1 on this 
simple example. There we plot the logarithm of the second moment of the number of 
particles in the ensemble at time 1 (i.e. after l/e steps of the algorithm) versus — log e 
for several values of e. Clearly, E(A^^) is growing exponentially in 1/e. This instability 
can be removed by carrying out the branching steps (Steps 3-5) only every 0(1) units 
of time instead of every e units of time and accumulating weights for the particles in the 
intervening time intervals. However, this small e regime cleanly highlights a serious 
(and unnecessary) deficiency in DMC. In contrast to Algorithm 1, the method that we 
will describe in Algorithm 2 below, appears to be stable as s vanishes (a property that 
will be confirmed later in Proposition 6.2). 

Our modification of DMC described in Algorithm 2 below not only suppresses the 
small £ instability just described, but results in a more robust and efficient algorithm in 
any context. Inspired by the RESTART and DPR algorithms studied in [VAVA91, HT99, 
DDI 1] we append to each particle a "ticket" 9 limiting the region that the particle is 
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allowed to visit. Samples are eliminated when and only when they violate their ticket 
values. As a consequence, particles typically survive for a much longer time, and in 
many situations there is a well-defined limiting algorithm when £ — ^ 0. More precisely, 
the modified algorithm proceeds as follows: 

Algorithm 2 Ticketed DMC 

1. Begin with M copies x^^ = Xq . For each j = 1, . . . ,M choose an 
independent random variable ~ ZY(0, 1) . 

2. At step k there are N^^ samples (x^l,0^^^). Evolve each of the 
x]^l one step to generate Nke values 

^(k+l)e ~ e dx I Vke = xf^). 

3. For each j = 1, . . . , iVfee , let 



p(j) = (>-x(.x'■ii,x'■^^^^_^^) _ (2.8) 



If P(J> < e^^l then set 



If PO) > ef^ then let 
and set 

TVO) = LpO) + uO')J V 1 , (2.9) 
where u*^^' are independent W(0, 1) random variables. 
4. VoT j = l,...,Nke, if N^'^ > set 

^(fe+i)£ - ^(fc+i)£ ana 0(^k+i)e - p(j) 
and for i = 2, . . . , N'^'> 

= ilii), and ~ ZY((P«)-M). 
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5 . Finally set N(k+i)s = list the N(^k+i)e vectors 
l--^(fe+l)£j \-^(k+i)eij=i 

6. At time t produce the estimate 

i=i 

Remark 2.1 Both here and in Algorithm 1 , we could have replaced the random variable 
l^pO) _|_ yO)j by aj^y integer- valued random variable with mean P^^\ The choice given 
here is natural, since it is the one that minimises the variance. From a mathematical 
perspective, the analysis would have been slightly easier if we chose instead to use 
Poisson distributed random variables. 

We should first establish that this new algorithm "does the job" in the sense that if 
the steps in Algorithm 1 are replaced by those in Algorithm 2, the resulting ensemble of 
particles still produces an unbiased estimate of the quantity (/)t defined in (2.2). This 
is the subject of Theorem 4.1 below which estabhshes that, indeed, for any bounded test 
function / and any choice of x, 

^ft = {f)t ■ (2.10) 

We have aheady mentioned that a considerable portion of this paper is devoted 
to showing that our modification of DMC, Algorithm 2, does not suffer the small s 
instability observed in Algorithm 1 . In that asymptotic regime Algorithm 2 dramatically 
outperforms the straightforward generalisation of DMC in Algorithm 1. However, a 
surprising side-effect of our modification is that Algorithm 2 is superior to Algorithm 1 
in all contexts. Before describing our theoretical results in the small e limit, we will 
therefore compare Algorithms 1 and 2 independently of a specific regime (discretisation 
stepsize, choice of x, etc). 

The key tool in carrying out this comparison is Lemma 4.4 in Section 4. That lemma 
establishes that by simply randomising all tickets uniformly between and 1 at each 
step of Algorithm 2, one obtains a process that, in law, is identical to the one generated 
by Algorithm 1. With this observation in hand we are able to establish in Theorem 4.3 
that the estimator produced by Algorithm 2 always has lower variance than Algorithm 1 , 
i.e. we prove that 

var ft <yar' ft, (2.11) 

holds for any bounded test function /, any underlying Markov chain Y, and any choice 
of X- In expression (2. 1 1) we have again distinguished expectations under the rules of 
Algorithm 1 from all other expectations by a superscript 1. 

In comparing Algorithms 1 and 2, it is not enough to simply compare the variances 
of the corresponding estimators: one should also compare their respective costs. Since 
the dominant contribution to any cost difference in the two algorithms will come from 
differences in the number of times one must evolve a particle from one time step to the 
next we compare the expectations of workload 

fe=o 
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imder the two rules. However, by (2.10) with / = 1 we see that 

t/£-l fe-1 

E^Wt =EWt = M ^ Eexp(-^x(%.,yo+i)e)) , 

fe=0 j=0 

so that the expected cost of both algorithms is the same. In fact, the proof of Theorem 4.3 
can be modified slightly to show that the variance of the workload of Algorithm 2 is 
always lower than the variance of workload of Algorithm 1. These remarks leave httle 
room for ambiguity. Algorithm 2 is more efficient than Algorithm 1 in any setting. 

Let us return now to the behaviour of the process generated by Algorithm 2 in the 
£ — )■ limit in the special case dehneated by equations (2.6) and (2.7) (but allowing 
more generality in the variables). This topic is the subject of Sections 5 and 6. We 
begin in Section 5 by motivating the construction of the continuous-time limit of the 
process generated by Algorithm 2 in the special case in which F is an appropriately 
scaled random walk and xi^^ V) = V — x. The process that we construct there in 
Definition 5.8 is built recursively by successive realisations of a Poisson point process 
in a space of excursions of yt. We call this process the Brownian fan. Loosely speaking, 
the Brownian fan is a branching process obtained in the following way. Start with a 
(usually finite) number of initial particles on R that furthermore come each with a tag 
G R, such that the position x satisfies x > v. Each of these particles, which we 
call the ancestor particles, perform independent Brownian motions until they hit the 
barrier x = v, where they get killed. Each of these particles independently produces 
offspring according to the following mechanism. Denoting by £ the space of excursions 
on R, let M be the Poisson point process on R x f with intensity measure a rft Q for 
some a > 0, where Q is Ito's Brownian excursion measure [It672]. If {t, w) is one of 
the points of M and the corresponding ancestor particle is alive at time t and located 
at xt, then it gives raise to an offspring which performs the motion described by w, 
translated so that the origin of the excursion lies at (t, Xi). This mechanism is then 
repeated recursively for each new particle created in this way. 

The Brownian fan has a number of very nice properties. For example, as established 
by Theorem 5.13, the continuous-time analogue of the number of particles at time t, N-t, 
corresponding to the Brownian fan satisfies 

supEexp (At iVt) < 00 (2.12) 
t>o 

for some continuous decreasing function \t > 0. As established in Proposition 5.15, 
the bound in (2.12) imphes that the continuous-time analogue of the workload, 

Wt= f Nsds 



J 

Jo 



is very nearly a differentiable function of time, 

sup Imi — — — < 00. 

tKTh^o h\logh\ 

This is close to optimal, since there turns out to be a dense set of exceptional times at 
which there are infinitely many particles ahve (see the argument given at the end of 
Section 5.2.3 below). 

Finally, in Proposition 5.18 we estabUsh that the Brownian fan is a FeUer process in 
a suitable state space which, when combined with the continuity property estabUshed in 
Proposition 6.8 implies that the Brownian fan is a strong Markov process. 
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The usual method by which one attempts to characterise the continuous-time Hmit 
of a sequence of discrete-time processes involves studying the limit of the generators of 
the discrete-time processes. In our case, as described in Section 5.3.4 the discrete-time 
generators do not converge to the correct limit. Surprisingly (and rather confusingly), 
they do actually converge to the generator of a Brownian fan, but unfortunately with 
the wrong parameters! Nevertheless, without resort to generators, in Theorem 6.11 we 
are able to establish the convergence of the process generated by Algorithm 2 to the 
Brownian fan. En route to this result we estabUsh a number of important and extremely 
encouraging results about the behaviour of the process generated by Algorithm 2 for 
finite e. For example, in Proposition 6.2 we obtain that 

sup ^Ntf < oo , 

e<eo 

for any p > Q and eo > sufficiently small where here Nt refers to the number of 
particles generated in Algorithm 2 and not to the Brownian fan (for which we have the 
even stronger result in (2.12)). In Corollary 6.10 we prove a imiform (in e) form of 
continuity of the processes. 

The existence of a continuous-time limit along with the many other robust features of 
Algorithm 2 established in the results just mentioned strongly suggest that Algorithm 2 
has significant potential as a flexible and efficient computational tool. In the next section 
we provide numerical evidence to that effect. 

3 Two examples 

In the following subsections we apply our modified DMC algorithm to two simple 
problems. In Section 3.1 we consider the approximation of a small probabiUty related 
to the escape of a diffusion from a deep potential well. We will use Algorithm 2 to bias 
simulation of the diffusion so that an otherwise rare event (escape from the potential 
well) becomes common (and then reweight appropriately). We will see that extremely 
low probability events can be computed with reasonably low relative error. In Section 
3.2 we demonstrate the performance of Algorithm 2 in one of DMCs most common 
application contexts, particle filtering. We will reconstruct a hidden trajectory of a 
diffusion, given noisy but very frequent observations. Our comparison shows that the 
reconstruction produced by the modified method is dramatically more accurate than the 
reconstruction produced by straightforward generalisation of DMC. 

3.1 Rare event simulation 

In Quantum Monte Carlo appUcations, the function % is (for the most part) specified by 
the potential V as in (2.3). In other applications however, there may be more freedom in 
the choice of x to achieve some specific goal. As already mentioned earlier, the choice 
X{x, y) = V{y) — V{x) tums Algorithm 2 into an estimator of 

(/)* = e^(-«'E(/(yt)e-^(^'') . 

The design of a "good" choice of V will be discussed below. By redefining / we find 
that e~^(^o' e^/j is an unbiased estimate of E(/(t/t)), i.e. 

e-y(.o)E^e^(-"*V(x^^) = E/(j/,), 
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Figure 2: (Left) The initial condition to generate the results in Table 1 . (Right) A 
typical sample of the event y2 ^ B. 

where the samples x^p are generated as in Algorithm 2. This suggests choosing V to 
minimise the variance of e^/j. Intuitively this involves choosing V to be smaller in 
regions where / is significant. The branching step in Algorithm 2 will create more 
copies when V decreases, focusing computational resources in regions where V is 
small. 

As an example, consider the overdamped Langevin dynamics 

dyt ^ -^U{yt)dt + ^j2^dBt 

where y represents the positions of seven two-dimensional particles (i.e. y £ IR^'*) and 

U(x) = ^4(||a;, - x,\\-^^ - \\x, - x.-r^) . 
i<j 

In this formula Xi represents the position of the ith particle (i.e. Xi e M^). In various 
forms this Lennard-Jones cluster is a standard non-trivial test problem in rare event 
simulation (see [DBC98]). 

After discretising this process by the Euler scheme 

y(k+l)e = Vke - ^U(yke)e+ y^¥fe £,k+l 

with a stepsize of e = 10"^ we apply Algorithm 2 with 

1 ^ 



X(x, y) = V{y) - V{x), V(x) = ^ ram 

kl i>2 
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The properties of the Euler discretisation in the context of rare event simulation are 
discussed in [VEWb]. 
Our goal is to compute 

F,A{y2eB), B ^ {x : V(x)< 0.1} , 

where our initial configuration is given by xf- = (0, 0) and for j = 2, 3, . . . , 7, 

x^ =(cos-,sm-j 



(see Figure 2). Starting from initial configuration xq = x , the particle initially at the 
centre of the cluster (xi) will typically remain there for a length of time that increases 
exponentially in 7^^. We are roughly computing the probability that, in 2 units of time. 
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0.05 


0.85 


9.290x10" 


■8 


30.84 


1.002x10-" 


9.290x10" 


-8 


0.025 


0.8 


1.129x10" 


13 


204.8 


1.311x10-21 


1.129x10" 


-13 



Table 1 : Performance of Algorithm 2 on the Lennard- Jones cluster problem described 
in Section 3.1 at different temperatures (7). A measure of the efficiency improvement 
of Algorithm 2 over straightforward simulation is obtained by taking the ratio of the last 
two columns 



the particle at the centre of the cluster exchanges positions with one of the particles in 
the outer shell. This probability decreases exponentially in 7-^ as 7 — )• 0. In this case 

fix) = iBix) , 

where 1b is the indicator function of B. In our simulations, A is chosen so that the 
expected number of particles ending in B, E/2, is (very) roughly 1. The results for 
several values of the temperature 7 are displayed in Table 1. 

The workload referenced in Table 1 is the (scaled) expected total nimiber ofWU 
evaluations per sample, i.e. the expectation of 

2/e 
k=l 

Note that when y = 0, Algorithm 2 reduces to straightforward (brute force) simulation 
and the workload is exactly 2. Increasing A results in the creation of more particles. 
This has the competing effects of increasing the workload on the one hand but reducing 
the variance of the resulting estimator on the other. If we let denote the variance 
of one sample run of Algorithm 2 (i.e. var e"^*^ with M = 1) and a^^ the 

variance of the random variable /(j/2), then the number of samples required to compute 
an estimate with a statistical error of err is M = a"^ /err and M^f = a^^/err for 
Algorithm 2 and brute force respectively. Taking account of the expected cost (scaled 
by e) per sample of Algorithm 2 (reported in the "workload" column) and of brute 
force simulation (identically 2) we can obtain a comparison of the cost of Algorithm 2 
and brute force by comparing the brute force variance to the product of the variance 
of the estimate produced by Algorithm 2 and one-half the workload reported in the 
fourth column of the table. These quantities are reported in the last two columns of the 
table. By taking the ratio of the two columns we obtain a measure of the relative cost 
per degree of accuracy of the two methods. One can see that the speedup provided by 
Algorithm 2 becomes dramatic for smaller values of 7. The variance of the brute force 
estimator is computed from the values in the third colunrn by 

brute force variance = P^a {y2 € B){1 — P^a (^2 € B)). 

Only the estimates in the first two rows of Table 1 were compared against straightforward 
simulation. Those estimates agreed to the appropriate precision. 

The choice of V used in the above simulations is certainly far from optimal. Given 
the similarities in this rare event context between Algorithm 2 and the DPR algorithm, it 
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should be straightforward to adapt the results of Dean and Dupuis in | DD 1 1 ] identifying 
optimal choices of ^ in the small 7 Umit. In fact, we suspect (but do not prove) that the 
optimal choice of V at any temperature is given by the time-dependent function 

V(k, x) = - logP(y2 eB\yke= x). 

This choice is clearly impractical as it requires knowing the probability that we are 
trying to compute. Even asymptotic estimates based on taking an appropriate low 7 
limit of this choice are often not practical so it is worth noting that, even with a rather 
clumsy choice of V, om scheme is still able to produce impressive results to fairly low 
temperatures. We fully expect however that without a very carefully chosen V, the cost 
of achieving a fixed relative accuracy for our scheme will grow exponentially with 7"^ 
as 7 ^ 0. This characteristic is common in rare event simulation methods. 

3.2 Non-linear filtering 

hi this section our goal is to reconstruct a sample path of the solution to the stochastic 
differential equation 

dyl^^ = 10(2/Ji> - yf )) dt + V2dB[^^ , (3.1) 
dy?^ = {y[^\28 - ) - ') dt + ^/2dBf' , (3.2) 

dyr>^{y^W>~lyr>)dt + V^dBf>, (3.3) 

with 

/ -5.91652 \ 

yo = -5.52332 
\ 24.57231 / 

The deterministic version of this system is the famous Lorenz 63 chaotic ordinary 
differential equation [Lor63]. The stochastic version above is commonly used in tests 
of on-hne filtering strategies (see e.g. [MGG94, MTAC12]). 

The path of yt is revealed only through the 3-dimensional noisy observation process 

dht ^ytdt + 0.1 dBt (3.4) 

where the 3-dimensional Brownian motion Bf is independent of Bt. Our task is to 
reconstruct the path of yt given a realisation of /i*. More precisely, our goal is to sample 
from (and compute moments of) the conditional distribution of t/j given Tj' , where 
is the filtration generated by h. One can verify that expectations with respect to this 
conditional distribution can be written as 

E«(/(yt)exp(-10/o(y„d/i(s))-50/J||2;,||2(is)) 

— , (3 5) 

ESexp(-10/o*(t/«,d/i,) - 50j*\\y,\\^ds) 

where the superscript B on the expectations indicates that they are expectations over Bt 
only and not over Bt, i.e. the trajectory of ht is fixed. We will focus on estimating the 
mean of this distribution (fix) = x) which we will refer to as the reconstruction of the 
hidden signal. 

As always, we should first discretise the problem. The simplest choice is to again 
replace (3.1) by its Euler discretisation with parameter e. The observation process (3.4) 
can be replaced by 

h(k+l)e - hke ~yke£ + O.ly/e TJk+l (3.6) 
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Figure 3: Hidden signal (black), together with the observed signal in gray. 



where the 77^ are independent Gaussian random variables with mean and identity 
covariance. We again set e = 10"''. With this choice for ht, it is equivalent to assume 
that the observation process is the sequence of increments h(k+i)e — hke instead of H 
itself. To emphasise that we will be conditioning on the values of h(k+i)e — hke and not 
computing expectations with respect to these variables we will use the notation to 
denote a specific realisation of /i(fe+i)£ — hks- In Figure 3.2 we plot a trajectory of the 
resulting discrete process y and observations h(k+i)s — hks- 

Note that given a particular state yke = x, the probability of observing h(k+i)s — 
hke = A is 

and expectations of the discretised process y given the h(k+i)£ — hke = Afe observations 
can be computed by the formula 



/ 

■^fe ^EUfc£exp(^-^ 



0.02e 



(3.7) 



where the normalisation constant Zk is given by 



Zfc =Eexp(-^ 



0.02e 



■)■ 



In the small e limit, formula (3.7) (with k = t/e) indeed converges to (3.5) (see [Cril 1]). 
Expression (3.7) suggests applying Algorithms 1 or 2 with 



Xix, y) 



\y£ - Afc+i| 
0.02e 



at each time step. Below it will be convenient to consider the resulting choice of P'*', 
P® = exp I - 



0.02e 
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in Algorithm 2. With this choice, the expected number of particles at the kth step would 
be 



MEexp(^-^ 



0.02e 



■)• 



a quantity that my decay very rapidly with k. Since our goal is to compute conditional 
expectations we are free to normalise P*'^ by any constant. A tempting choice is 



exp 



0.02e 



which would result in ENke = M for all k. Unfortunately we do not know the 
conditional expectation in the denominator of this expression. However, for a large 
number of particles Nke we have the approximation 



1 ^ / 



This suggests using 



0.02£ 



E exp - 



|y(fc+i)££ - Afc+il 
0.02£ 



) I Ai,...,Afe) 



exp - 



11^; 



0.02s 



ErJl exp - 



0.02e 



(3.8) 



which will guaranty that ENke = M for all k. 

In fact, in practical applications it is important to have even more control over the 
population size. If we use P*^'' in formula (3.8) in our DMC algorithms, the resulting 
A^-process will be a Martingale bounded below by the absorbing state N = 0. As such 
it will eventually reach that absorbing state, an inconvenient predicament in practical 
implementations. Stronger control on the number of particles can be achieved in many 
ways (e.g. by resampling strategies [dDG05, Liu02]). We choose to make the simple 
modification 



p« = M- 



exp 



ll£'" 



-Afc 



0.02£ 



0.02s 



(3.9) 



which results in an expected number of particles at step fc + 1 of M independently of 
the details of the step k ensemble. Formula (3.9) depends on the entire ensemble of 
walkers and does not fit strictly within the confines of Algorithms 1 and 2. This choice 
will lead to estimators with a small, M-dependent, bias. All sequential Monte Carlo 
strategies with ensemble population control suffer a similar bias. 

The results of a test of Algorithm 2 with this choice of P^'^ are presented in Figure 5 
for M = 10. The true trajectory of y is drawn as a dotted line, while our reconstruction 
is drawn as a solid line. Note that the dotted line is nearly completely hidden behind 
the solid Une, indicating an accurate reconstruction. In Figure 4 we show the results 
of the same test with Algorithm 2 replaced by Algorithm 1. The resulting method is 
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Figure 4: Reconstruction of the three components of the signal using Algorithm 1. The 
hidden signal is shown as a dotted line. 




Figure 5: Reconstruction of the three components of the signal using Algorithm 2. The 
hidden signal is shown as a dotted line. 
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very similar to a standard particle filter with the conunon residual resampling step (see 
[Liu02]). 

With our small choice of e, one might expect that the number of particles generated 
by Algorithm 1 would explode or vanish almost instantly. Our choice of P*^*' prevents 
large fluctuations in N. We expect instead that the niunber of truly distinguishable 
particles in the ensemble (in the sense that they are not very nearly in the same positions) 
will drop to 1 instantly, leading to a very low resolution scheme and a poor reconstruction. 
This is supported by the results shown in Figure 4. 

The fact that we obtain such a good reconstruction in Figure 5 with only 10 particles 
indicates that this is not a particularly challenging filtering problem. Challenging filter- 
ing problems and more involved methods for dealing with them are discussed in [VEWa]. 
The purpose of this example is to demonstrate that by simply removing unnecessary 
variance from a particle filler (via Algorithm 2) one can improve performance. This 
improvement is illustrated dramatically in our small e setting. In practice it is advisable 
to carry out the copying and killing steps in Algorithm 2 less frequently, accumulating 
particle weights in the intervening time intervals. Nonetheless, any variance in the result- 
ing estimate will be unnecessary amplified if one uses a method based on Algorithm 1 
instead of Algorithm 2. 

4 Bias and comparison of Algorithms 1 and 2 

In this section we establish several general properties of Algorithms 1 and 2. In contrast 
to later sections we do not focus here on any specific choice of the Markov process y or 
the function %. For simplicity in our displays we will assume in this section that y is 
time-homogenous, i.e. that its transition probability distribution is independent of time. 
As above we use and E to denote expectations under the rules of Algorithms 1 and 2 
respectively. In the proof of each result we will assume that M = 1 in Algorithms 1 
and 2. The results follow for > 1 by the independence of the processes generated 
from each copy of the initial configuration. 

Our first main result in this section estabUshes that Algorithms 1 and 2 are unbiased 
in the following sense. 

Theorem 4.1 Algorithm 2 produces an unbiased estimator, namely 



The expression also holds with E replaced by W. 

The proof of Theorem 4.1 is similar to the proof of Theorem 3.3 in [DDll] and requires 
the following lemma. 

Lemma 4.2 For any bounded, non-negative function F onW^ x'Rwe have 




t/e-l 



(4.1) 




(4.2) 



The expression also holds with E replaced by E . 
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Proof. From the description of the algorithm and, in particular the fact that given xq, 
and x^P, the tickets are all independent of each other, we obtain 



^\^F{xf,ef) xo,x'-^^j=J Fixf\ue^^='°'^'^'^)l^ 



-x(xo,i*i*)>„) 



The first term on the right can be rewritten as 



J 

while the second is 



Combining terms we obtain 



(x(xo,4")<0) 
t/ 

/■I 
Jo 



which establishes (4.2). A similar (but simpler) argument proves the result for E replaced 
byE^ □ 

With Lenoma 4.2 in hand we are ready to prove Theorem 4. 1 . 
Proof of Theorem 4.1. First notice that by Lenmia (4.2), we have the identity 

E ^ Fixi^\ ef) = E J^e-'^*^"'^-^ ^ F(ye, u) du^ , (4.3) 

for any function F(x, 9). In particular, the required identity holds for A; = 1 by setting 

F{x,e) = f{x). 

Now we proceed by induction, assuming that expression (4.1) holds for step fc — 1 
and prove that the relation also holds at A;. Notice that, by the Markov property of the 
process, 

e5:/(x<^])=E^E^o>,,o, E • 

Setting 

N(k-i)s 

Fix,e) = E,,e E /(4-i)e) 
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in expression (4.3), we obtain 

Nke 



(fc-l)£ 

According to the rule for generating the initial ticket we have that 



xfLn.) )du]. 



/-"'(fc-i). 

Jo V ,=1 



and we have therefore shown that 

We can conclude, by our induction hypothesis, that 

Nk. 



= E( /(2/t)exp(- ^ Xiyke,y<.k+l)e))] , 



fe=0 

and the proof is complete. A similar argument proves the result for E replaced by E^. 

□ 

The second main result of this section compares the variance of the estimators 
generated by Algorithms 1 and 2. We have that 

Theorem 4.3 For all bounded functions f, one has the inequality 

var,/i < var^/t . 

The inequality is strict when f is strictly positive. 

The following straightforward observation is an important tool in comparing Algo- 
rithms 1 and 2. 

Lemma 4.4 After replacing the rules 

f)(j) 

0[ii\,e = j^, and e^%^r.U{{P^Y\l) fori>l, 
in Step 4 of Algorithm 2 by 

^E'+i)e~"(0,l) for alii, (4.4) 

the process generated by Algorithm 2 becomes identical in law to the process generated 
by Algorithm 1. 
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Remark 4.5 Note that the resampUng of the tickets in (4.4) applies to all particles, 

including 2 = 1. 

As with the proof of Theorem 4.1, the proof of Theorem 4.3 is inductive. The most 
cumbersome component of that argument is obtaining a comparison of cross terms that 
appear when expanding the variance of the estimators produced by Algorithms 1 and 2. 
This is encapsulated in the following lemma. 

Lemma 4.6 Let F be any non-negative bounded function ofW^ x R, which is decreasing 
in its second argument. Then 

and the bound is strict when, for each x, F{x, 6) is a strictly decreasing function of 6. 

Proof. First note that x^p = ifp for each j. Furthermore, conditioned on xq, if-p, 
and A^£, the tickets are independent of each other and for j > 2 they are identically 
distributed (for Algorithm 1 they are identically distributed for j > 1). These facts 
imply that, for the new scheme. 



xP = 



^(j:T.F^^e\e)Fixp,op) 

E(21(jv,>i)(^s - l)\iPMF(xP,eP)\xP)E{F(£P,eP)\X 



2 



+ E(l(Ar^>i)(7V, - 1)(N, - 2) I iP)E{F(xP, 9f) \ x«) . (4.5) 
For Algorithm 1 the tickets 9^ are i.i.d. conditioned on x^ so that 

■AT, AT, 



EMEEJ^(4^^^^^V(4^^a 4" = 

E} {N,(N, - 1) I J«)Ei {FixP, eP) I xP)\ (4.6) 

Let 

Note that on the set {P < 1}, we have < 1, so that expressions (4.5) and (4.6) 
vanish. On the set {P > 1}, we have 

E(F(i«, ef^) I xP) = FixP, u) du , 

E {F{xP , eP) \xP)=P / FHP ,u)du, 
Jo 



and 



so that 



E^ {FixP, eP) I xP) = I FixP, u) du , 
Jo 



E^ (F(i«, ep) I xP) = UimP, eP) I if) + ^-^E{F(ip, ef) \ xP) 
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In other words, if we set 

A = E(F(4i>,^f)|4i)) 

and 



^(1) 

we have the identity 



In Algorithm 2, 6'^^^ < 6'*.^^ so that, if F is a decreasing function of Q, then 

B > 0. 

Now note that the variable is the same for both algorithms and is given expUcitly 
by the formula 

= l(eW<e-x(«o.4")<l) + l(x(xo,x«)<0) 

-X(a;o,£L^') 111 1 
J +\[/<e-x(^o,4")-[e-x(xo.4")J) j ' 

where ?7 is a uniform random variable in [0, 1]. Using this formula we can compute the 
expectations involving exphcitly. Defining 

R = P-\_P\, 

on the event {P > 1} we have that 

{N^{N^ - 1) I i«) = (P - R){P - 1 + i?) , 

E(l(;V,>l)(A^e-l)|4'^) =P-1, 

and 

E(l(;v,>i)(A^e - l)(iV. - 2) I i«) = (P - 1 - P)(P - 2 + P). 

The difference of terms (4.6) and (4.5), which vanishes on P < 2, can now be 
written as 

2(P - V)AB - 2(P - P)(P - 1 + R)Ab\^ - (P - P)(P - 1 + P)P^ ^ 

P P^ 

on the set {P > 2}. Recalling that 5 > (we can drop the last term) and rearranging 
we see that this expression is bounded above by 

AB 

2^{PiP-i)-iP-R)iP-i + R)). (4.7) 

Note that since < P < 1 we have P - 1 <P-1 + P<P-P<P. The difference in 
the parenthesis in (4.7) is the difference in the area of two squares with equal perimeter, 
the second of which (in the sense of the last sequence of inequalities) closer to square. 
The difference is therefore non-positive. All bounds are strict whenever, for each x, 
F(x, 9) is a strictly decreasing function of ^. □ 

Finally we complete the proof that the variance of the estimator generated by 
Algorithm 2 is bounded above by the variance of the estimator generated by Algorithm 1 . 
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Proof of Theorem 4.3. By Theorem 4.1 we have that 

E7t=E7t , 

so it suffices to show that 

E(/t)2 < ^\ftf . (4.8) 

Furthermore, since the variance does not change by adding a constant to / we can, and 
will from now on, assume that f{x) > for every x. In order to prove (4.8), we will 
proceed by induction. Since the random variables and i*.^' are the same in both 
schemes, the result is true for k = 1. Now assume that (4.8) holds through step A; — 1. 
We will show that 

j=i j=i 

Observe that we can write 

E/(-^) = EE/(<^^)' 

j=l j=l i=l 

where we used N^l^ to denote the number of particles at time ke whose ancestor at 

time e is x^J\ The descendants of x^/^ are enumerated by x^^'l^^ for i ~ 1,2, ... , N^J\g. 

By the conditional independence of the descendants of the particles at time ke when 
conditioned on xq, x^p, and N^, we have that 

E E/(^5^e) =E EE.«..<4 E 
Vj=l / Vj = l V i=l 



^(fc-l)e' 



EEe4-«- E /(^ 

/N(k-l)e s 

xE.<-.,.« E )• (4-10) 



An identical expression (with E replaced by E^) holds for Algorithm 1. Setting 

1=1 

and 



E /(4-l)e) ' 
i=l ^ 



we can rewrite the last identity in the case of Algorithm 2 as 

/Nke \2 JVe Ne Ne 

e( E /(^fce) ) = E E ^2(a=^\ ^e^'^) + E E E ^e^'m(4^ ^i'')- 

S=l ^ j=l 3 = 1 iT^j 
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In the case of Algorithm 1 with 



and 



we have 



/Affc. N 2 Af. We N, 

Note that Lemma 4.2 impUes that 

i=i i=i 

Under the rules of Algorithm 1, conditioned on xq and x'^P, the ticket 0^ is independent 
of A^e. So we can write the last equality as 

/ \\ 
E ^ i^2(4'\ = EM ^ E,« . 

i=i S=i ^ 

By our inductive hypothesis we then have that 

E f; F,{xf, of) < E^ E^o-, [P^ixf, ef)) ) 

Appealing again to the conditional independence of 6^^ and under Algorithm 1 we 
have that 

E F2{xf, 9f) < El ^ F^{x^,\ O'p) . 

We now move on to the second term in (4.10). It follows from the definition of 
Algorithm 2 that the function Fx is strictly decreasing in 9, so that Lemma 4.6 yields 

.TV, AT, . , N, 

i^j ^ ^j=i i^j 

Under the rules of Algorithm 1, conditional on xq and x'^\ the 9^ are all independent 
of each other and of A'^. Therefore we can integrate over the tickets to obtain 



^ (E E (^1(4^'' (^1(4^'' 



\7 = 1 
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By Theorem 4.1 we have that 



or 



1=1 1=1 



This yields 



< E ' 



' (E E e^o) ^«'))e^.> [p^ixf, ef)^ ) . 



Reinserting the tickets we obtain 

e(E E ^1(4^'' ^e^')^i(4^ ) < EM E E ^^^')^i°(4^ 0^:^)) , 

^j=l i^j ^ S=l i¥=j ^ 

which completes the proof. □ 
5 The continuous-time limit and the Brownian fan 

From now on, we restrict ourselves to the analysis of the case x(a^, V) = V(y) — V(x) 
for some "potential" V defined on the state space of the underlying Markov process. We 
argue that if the underlying process is obtained by approximating a diffusion process 
then, unlike in the case of Algorithm 1, our modification. Algorithm 2, converges to a 
non-trivial limiting process as the stepsize e converges to 0. 

We will first provide a heuristic argument showing what kind of limiting process 
one would expect to obtain. The remainder of this article will then be devoted to 
rigorously constructing the limiting process and proving convergence in the simple case 
in which the underlying Markov chain is a random walk (rescaled so that it converges 
to a standard Brownian motion) and the potential V is linear. In this case the limiting 
process is a very natural object that does not seem to have been studied in the literature 
so far. We call this object, which is closely related to the construction in [Hut09], the 
Brownian fan (see Section 5.2). It also has a flavour very similar to the construction 
of the Brownian web [FINR04] and the Brownian net [SS08], although there does not 
seem to be an obvious transformation linking these objects. 

5.1 Heuristic derivation of the continuous-time limit 

Throughout this section the imderlying Markov chain will be given by the following 
approximation to a diffusion: 

y(k+l)e = Vke + sFiyke) + ^/eT^iVke) 6+1 , Vke S R" , (5.1) 

where the ^ are a sequence of i.i.d. (not necessarily Gaussian) random variables with 
law V and the identity on R" as their covariance matrix. The functions F and S are 
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sufficiently "nice" functions, but since this section is only heuristic, we do not state 
specific regularity, growth or non-degeneracy assumptions. 

Concerning the function x> we take xi^^y) = Viv) — V{x) for some regular 
potential F : R" — >^ R. Recall now that, as long as a particle is alive, its ticket 6 evolves 
under Algorithm 2 as 

It is natural therefore to replace 9 by the quantity v given by 

In this way, the new "tag" v does not change over time, but is assigned to a particle at the 
moment of its birth. Translating Steps 3 and 4 of the algorithm into this slightly different 
setting, we see that if a particle performs a step from x to y such that V{y) < V{x), 
then it can potentially spawn one or more descendants. The tags v of the descendants 
are then distributed according to 

e-" !^W(e-^<^),e-^<2'>) , (5.2) 

and a particle with tag v Uves as long as it stays within the region {x : Vix) < v}. 



5.1.1 Description of the limit 

For very small values of e, the process described above has the following features. 
Taking the limit e ^ in (5.1), we observe that each particle follows a diffusion 
process, solving the equation 

dyt = Fiyt)dt + Eiyt)dBt, (5.3) 

where Bf is a standard rf-dimensional Brownian motion. If the particle has tag v, then 
this process is killed as soon as it exits the sublevel set {x : Vix) < v}. 

Consider the following representation of the object produced by Algorithm 2. Denote 
by Q| ^ the law of the e-discretization of (5.3) generated by (5.1) starting at x and killed 

upon exiting the set {y : V(y) < v}. Let r and {w/tej^ffo respectively, the lifetime 
and trajectory of the original particle. The trajectories of the offspring of this initial 
trajectory are very nearly given (it will be true in the small e limit) by a realisation fj,^'^ 
of a Poisson point process with intensity 

r/e-l 

Q^w, ■) = eJ2 ^'C'^) / Q^.„v(«,.,)+5 Vik, dS) , (5.4) 
where, according to the rule for generating new offspring in Algorithm 2, 



ifV(w^k+l)e)>V(Wke) 



and, according to the rule for generating offspring tickets in Algorithm 2, 



j fi5)r]ik,d5) = /(- log(l u(e-'^<"'<'=+"=^-^<'"'==» - 1))) du. 
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Here is the map that shifts trajectories forward by time t. Since each offspring 
behaves independently just like the original particle, this suggests that the nth generation 
/i^'" of offspring is obtained recvffsively as a realisation of the Poisson point process 
with intensity given by 

At each "microscopic" step, the probability of creating a descendent is of order y/e 
so that, in the Umit £ — >^ 0, each particle spawns descendants at infinite rate. However, 
any such descendant is created at distance 0{\/e) of the "barrier" Vix) = v. As a 
consequence, the probability that it survives for a time of order 1 before being killed is 
itself only of order y/e. Therefore, the rate at which a particle creates descendants that 
actually survive for a time r of order 1 is finite, but tends to infinity as r — >^ 0. 

Now we will consider the small e-limit of the object we have constructed. The 
trajectory wt becomes a sample path of (5.3) exiting the set {y : Viy) < v} at time r. 
Denote by Qx,v the law of the diffusion (5.3) starting at x and killed upon exiting the 
set {y : V(y) < v}, which is a probability measure on some space of excursions in 
R". The characterisation of the standard Ito excursion measure (see for example [RY91, 
Theorem 4.1] and [PY82]) then suggests that, for every a; e R" such that W ^ and 
S is non-degenerate, the limit 

Qx = Um -:Qx,v(x)+s , 

exists as a cr-finite measure in the sense that -^Qx.Vixj+s restricted to the set of excursions 
longer than a fixed length converges weakly to Q^^ restricted to the same set. 

The discussion so far suggests that for the limiting object, the trajectories of the first 
generation of offspring are given by a realisation /x^ of the Poisson point process with 
intensity measure 

Q(w,-)= I A(wt)etQmdt, (5.5) 
Jo 

for some intensity A : R" R+ yet to be determined and that the nth generation /i" of 
offspring is obtained recursively as a realisation of a Poisson point process with intensity 
given by 

e?"(-) = J Qiw,-)n''-\dw), 

with Q as in (5.5). In order to fully characterise the limiting object, it remains to provide 
an expression for the intensity function A. 

Let us start by replacing Q| ^ in equation (5.4) by Qx,v, i.e. by assuming that 
for small e, excvu^sions of the discrete process are very similar to excursions of its 
continuous time limit. We then apply the relations 

Qx,v<.x)+s ~ SQx (5.6) 

and 

ViW(k+l)s) - ViWke) ~ \/£(Vy(Wfee),S(Wfee)Cfe+l) 
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with the ^fe+i as in (5.1). We then formally obtain 

r/e-l 

l{VV(x),E(x)ifc+i><o(VV(Wfe£),S(t«fe£)^fe+l) 

fe=0 

X / u(Vy(w;fee),S(Wfee)Cfe+l)c^W©LQt"fee- 

Jo 

Our arguments so far therefore suggest that 

Aix) =11 (VF(x), nx)zf u(dz) , (5.7) 

^ J{vv(x),i:ix)z)<o 

where the distribution v has mean and identity covariance matrix. Assuming that v is 
synmietric this becomes 

Aix) = c(VF(a;), Y,ix)T,'^ix)WVix)) , (5.8) 

where c = j . If is not symmetric, one might even expect a prefactor c that depends 
on X. 

In fact, as we will see in a specific case in the remainder of this section, the correct 
value is c = |, whether the law of ^ is symmetric or not. The reason for this discrepancy 
is that the relation 

SQ 

used in our derivation is only valid if 5 ^ y/e. In our case however, one precisely has 
6 ~ ^/e, which introduces a correction factor that eventually gives rise to the value 
c = I . The aim of the next subsection is to show in more detail how this factor ^ arises 
in the simplest situation where F = and S = 1. 

5.1.2 The case of Brownian motion 

We now consider the one-dimensional case, where the hmiting underlying process is 
simple Brownian motion. Regarding the underlying discrete problem, we consider the 
Markov chain defined recursively by 

y(k+l)e = Vke + , (5.9) 

for an i.i.d. sequence of centred random variables with law and variance 1. For the 
potential function V, we choose V{x) = —ax for some a > 0. 

In order to show that the constant c appearing in (5.8) is equal to |, we will now 
argue that if we denote by Q the standard Ito excursion measure (which we normalise in 
such a way that Q = hm£_>.o jQe, where is the law of a standard Brownian motion 
starting at e and killed when it hits the origin) and by Q| the law of the random walk 
(5.9) starting at y/ez and stopped as soon as it takes negative values, then there exists a 
function G such that 

Ql^VeGiz)Q, (5.10) 

as e — )^ when both sides are restricted to excursions that survive for at least some fixed 
amount of time. We will see that the function G behaves like G(z) « z for large values 
of z, but has a non-trivial behaviour for values of order 1. In terms of our notation from 
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the previous subsection (since yt is spatially homogeneous and V is linear) this implies 
that the approximation (5.6) should really have been replaced by 

Since we assumed a > 0, our process creates offspring only when it performs a step 
towards the right, i.e. when ^fc+i > 0. The probability that a new particle is created in 
the fc-th step is approximately aS^k+i- Furthermore, the small e rule for the generation 
of tags implied by Algorithm 2 is 

_A^^ar^v + x^ (5.11) 

As a consequence, once we have identified the function G in (5.10), our arguments in 
the previous section lead to the formula 

{az) j aG{y) dy^ v{dz) 

If we can show that 

G{y)dyu{dz)=^, (5.12) 

a formula consistent with a choice of c = ^ in (5.8). 

In order to identify G, we note that if a random walk starting from ^/e6 survives for 
some time of order 1 before becoming negative then, with overwhelming probability, 
it will have reached a height of at least e-^^^ (say). Furthermore, if we condition the 
random walk Q| to reach a level ^/e-y with 1 <C 7 <C one would expect its law 

to be well approximated by y/s'jQ when restricted to excursions that survive for a time 
of order 1. 

As a consequence, we expect that 

Ql « P^,yV^-fQ , 7 » 1 , 

where Pz^-y denotes the probabihty that the simple random walk (5.9) with e = 1 started 
at z reaches the level 7 before becoming negative. 

The remainder of this section is devoted to the proof of the fact that if we define 
Pz^-y in this way, then under some integrability assumptions for the one-step probability 
v, the limit 

G{z) = Um jPz,j , 

7— >oo 

exists and does indeed satisfy (5.12), independently of the choice of v. Actually, we 
will prove these statements for the quantity P^.^ = Pz^-y+z, which we interpret as the 
probability that the random walk starting at the origin reaches [7, 00) before reaching 
(—00, —z]. Our first result is as follows: 



A{x)= [ 
Jo 

where is the law of the steps ^fe. 







then we will have 
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Proposition 5.1 Assume that the law v satisfies i'(,{\x\ > K}) < C exp(—cK^) for all 
K > and some strictly positive constants c, C and /3. Then, the limit 



G(s) = lim jP^^ (5.13) 

7— >oo ' 



exists and satisfies the relations 



G{s) = I G(s + z) v{dz) , s > , lim — = 1 . (5.14) 



s 

Furthermore, for every 5 > there exists C such that the bound 

1(7 + s)P,,^ - G{s)\ < , (5.15) 

■-y 2 " 

holds uniformly for all s >0 and 7 > 1 V s. 

Proof. Denote by yk the fcth step of the random walk starting at the origin. Our main 
tool is the quantitative convergence result [Fra73], which states that the supremum 
distance between a Wiener process and the diffusively rescaled random walk over n 
steps is of order n^^^^. 

As a consequence we claim first that, for every 6 > there exists a constant C such 
that, for every a G [| , 3], we have the bound 



^"^■^ 1+a 



-y 2 " 



valid for every 7 > 1. Indeed, for any n > 1, it follows from the previously quoted 
convergence result that there exists a Brownian motion B such that \Bt -yit\ \ < 
for alH e [0, ^^] with probabihty greater than 1 — C/n'. Here, S > and g > 1 are 
arbitrary, but the constant C of course depends on them. 

Take n such that n^^^~^^ ^7- If 2/ hits [7,00) before (— c»,— 07], then either 
sup(<„ \Bt - uni I > n^/''+'', or supj<„ \Bt\ < 87, or B hits [7 - n^/*+^, 00) before 
it hits (—00, —07 — n^/*+^]. As a consequence. 



-Pa7,7 < , 1- — + exp(-cn/7 ) 



a7 + nV4+^ C_ 
(1 + a)7 ni 

Reversing the roles of 7 and 07, we thus obtain the bound 



^"^■^ 1+a 



<; \ h exp(— cn/7 ) . 

7 



Choosing 5 small enough and n = 7^+'', the claim (5.16) then follows. 

In order to obtain the convergence of the right hand side in (5.13), we make use of 
the fact that, for 7 > 7, one has the identity 

/>oo 

Ps^j — -^s,7 / -^8+7+2,7—7—2 i^^idz^ , (5.17) 
Jo 

where v-y is the law of the "overshoot" y„ — 7 at the first time n such that t/„ > 7, 
conditioned on never reaching below the level —s. Since Ps^^j^z,'i-i-z is an increasing 
function of z, we inmiediately obtain the lower bound 

-^s,7 ^ -^8,7-^8+7,7—7 5 
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If we choose 7 = 07 for a e [j, 5], it then follows from (5.16) that 

'7 + s C \ 



5,7 — ^ 5,7 ^ 



7 + S -yj 



for all 7 sufficiently large and uniformly over all s S [0, 7]. Setting Qs,-^ = (7 + s)Ps,'y, 
it thus follows that one has the bound 



Qs,^>Qs,^{l-^) , (5.18) 



possibly for a different constant C. Let 70 > 1 be such that the factor on the right 
of this equation is greater than 1/2. By (5.16), there then exists .so > 70 such that 
for s > Sq and 7 G [s, 2s] one has Qs^-y > C(l + s). Furthermore, for s < sq and 
7 e [(1 V s), 2(7o V s)], there exists a non-zero constant such that (5s,7 > C. Iterating 
(5.18), we then conclude that there exists a constant C > such that the bound 

> C(l + s) , 

holds uniformly over all s > and all 7 > 1 V s. 

On the other hand, for arbitrary a > 0, one has from (5.17) the lower bound 

-fs,7 — (^7({'^ T }) ~t~ -^5+7+7^,7— 7—7" )-Ps, 7 • 

In order to bound i'^({x > 7"}), we note that this event can happen only if either one 
of the first 7"^ increments exceeds 7", or the random walk never exceeds the value 7 
within these 7"^ steps. Similarly to before, it then follows that 

u^{{x > 7«}) < -^(73 exp(-c7«'') + 7-« + exp(-c7)) , 

for every q > and uniformly over s < 7. It follows from the lower boimd on Pg^-y 
obtained previously that Uj{{x > 7"}) < 7"' for any power g > 0, so that we obtain 
the upper bound 

'7 + s _^ C 

^ 7 + S J 

with the same domain of vahdity as before. Using a very similar argument as before, we 
obtain a constant C such that 7-Ps,7 < 0(1 + s) uniformly over s > and 7 > 1 V s. 
Combining the boimds we just obtained, we obtain 

C 

72 "T" 

uniformly over 7 > 7 > s, from which it follows immediately that the sequence 
{'yPs,'y}j>i is Cauchy, so that it has a limit G(s). 

It remains to show that G has the desired properties. The first one follows immedi- 
ately from the identity 

7-Ps,7 = / lPs+z,j-z Kdz) , 

which holds provided that we define the integrand to be 1 for z > 7 and for z < — s. 

In order to show that G(s)/s — > 1, we fix some (large) value s and choose 7„ = 2"s. 
It then follows from (5.16) that 

|Qo-s|<s^+', 



P.,<P.J^ + -Tz^), (5.19) 
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where we used the notation Q„ = (7„ + s)Ps -y^ as a shorthand. Furthermore, it follows 
immediately from (5.19) that there exists a constant C independent of s such that 
\Qn\ <Cs uniformly in n. As a consequence, we obtain the recursive bound 

Cs 

\Qn — Qn-l\ < • 
In 

Summing over n yields \Qn — s| < s^^^, uniformly in n, so that the claim follows. 
The quantitative error bound (5.15) follows in the same way. □ 

Corollary 5.2 In the same setting as above, one has the bound 



p _ 

5,7 



< 



7 + 5 

uniformly for all s > and 7 > 1 V s. 

Proof. Combine (5.15) with the bounds on G(s) — s obtained at the end of the proof 
above. □ 

Somewhat surprising is the fact that the function G obtained in the Proposition 5. 1 
does indeed satisfy (5.12), independently of the choice of transition probability v, 
provided that we assume that v has some exponential moment. 

Proposition 5.3 Let G be as in Proposition 5.1 and assume that the law v satisfies 

e'^^'^i'idz) <oo, 

for some c > 0. Then, one has the identity Jg°° z/([s, 00)) G{s) ds = |. 
Remark 5.4 Note that, by Fubini's theorem, 

/'QO /'OO /'S 

/ iyi[s,<x))Gis)ds= / / G{y)dyv{,ds), 
Jo Jo Jo 

so that we do obtain (5.12). 

Proof. Integrating (5.14) from to an arbitrary value K > and applying Fubini's 
theorem, we obtain the identity 



Jo 



K 

G(s) ds = I G(z) viiz - K, z]) dz . (5.20) 



In this proof we denote by I = G{z)v{[z, 00)) dz the quantity of interest. 
Simple algebraic manipulations then yield from (5.20) 

/■OO /"OO 

1=1 G{z)v{[z- K, 00)) dz- / G{z)u{[z- K,zJ)dz 
Jo Jo 

POO 

= / Giz)p([z- K, 00)) dz- / Giz)dz 
Jo Jo 

poo 

= / G{z){ui[z-K,<x^))-l,<K)dz. 
Jo 
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Since this identity holds for every ii' > 0, it follows in particular that one has 

/•oo 1*00 

X = s G(z) / - K, 00)) - l^<if)e-^-^ dK dz 

Jo Jo 

= G{z)(e u([z - K,oo))e-'^ dK - e-^'^'^ dz , (5.21) 
for every £ > 0. At this stage, we note that one has the identity 

/■oo poo 

£ / viiz - K, oo))e-'^ dK = e-"Ee^« - se"" / e^^ui[K, 00)) dK , 

Jo J z 

where ^ denotes an arbitrary random variable with law v. Since v has some exponential 
moment by assumption, u{{K, 00)) decays exponentially so that the second term in this 
identity satisfies 



/■oo 

£6"" / e^^v{[K, 00)) dK 



< Gee-""' , 



for some constants 7, C > 0, provided that s is small enough. Inserting this into (5.21), 
it follows that 

/■oo 

I = / G(-2)e-"E(e^« -l)dz + 0{e) . 
Jo 

At this stage, we use again the fact that ^ has exponential moments to deduce that 

Ee^« - 1 = — + 0{e^) , 
where we used the fact that E^ = and E^^ = 1, so that 

_2 /■oo 



/ 

Jo 



I = ^ / G(0)e-^^ dz + 0(s) . 



It then follows from the fact that lims_).(x, G{s)/s = 1 and the dominated convergence 
theorem that 

£2 r°° 1 

I = lim — / ze-" dt=- , 

e^O 2 Jo 2 

which is precisely the desired expression. □ 

Remark 5.5 It is clear that these results should hold under much weaker integrability 
conditions on i/. However, since we need some exponential moments on ly at several 
places in the sequel, we did not try to improve on this. 

5.2 Some properties of the limiting process 

In this section, we provide a rigorous definition of the limiting process loosely defined 
in Section 5.1.1, and we study some of its properties. In order to be able to use 
existing results on Brownian excursions, we restrict ourselves to the same situation as in 
Section 5.1.2, namely the case where the imderlying diffusion is a Brownian motion 
and the potential V{x) = —ax is linear. We call the resulting object the Brownian fan. 
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5.2.1 Recursive Poisson point processes 

Before we give a formal definition of the Brownian fan, we define a "recursive Poisson 
point process". Loosely speaking, this is a Cnimp-Mode-Jagers process [Jag75] with 
Poisson distributed offspring, but where the number of offspring of any given individual 
is allowed to be almost surely infinite. Note again that our construction is very similar 
to the one given in [Hut09]. Given a Polish space X and a function F: X R_|_, we 
denote throughout this section by M^(X) the space of u-finite measures nonX such 
that 

m(F-1(0)) = , ^{{x : F(x) > e}) < oo , 

for all e > 0. We endow this with the topology of convergence in total variation on each 
set of the form {x : F{x) > 1/n}. Given a (measurable) map Q from X to M^{X), 
we can then build for every x G X a point process as follows. 

Define = and, for n> I, define /u^ recursively as a (conditionally independent 
of the /U^ with i <n) realisation of a Poisson point process with intensity measure 



Qn= [ Qiy) f^2-\dy) , 
Jx 



where we view /u" as a random (T-flnite positive integer- valued measure on X. (In 
principle, it may happen that Q„({a; : F{x) > e}) = oo for some e > 0. In this case, 
our construction stops there.) 



When then set 



,[n] _ 

^=0 



and we call ^t,"^ the recursive Poisson point process of depth n with kernel Q. We wiU 
occasionally need to refer the Brownian fan spawned by an initial Brownian motion 
w. For this purpose we will use the symbol fil^^ (or fi^ for a specific generation) and 
rely on the context to differentiate /x^^ and /xfj^^. If, in these symbols we omit the 
subscript entirely then it is assumed that a; = 0. In general, there is no reason to expect 
the sequence /U^^ to converge to a finite limit. However, one has the following simple 
criterion ensuring that this is the case: 

Lemma 5.6 Let F and Q be as above and assume that there exists c < 1 such that 
J F(y)Q{x,dy) < cF(x) for every x € X. Then, for every x G X, there exists a 
random a-finite measure on X such that lim„^oo E / F{y){y}-^^ — /x!^"l)(dy) = 0. 

Proof. Fix X G X and denote by F„ the cr-algebra generated by /x^"'. It then follows 
from the definition of the /U^"^ that 

e(/ Fiy)ni+Hdy) F^) = / / F(y)Q(z,dy) ni(dz) < c [ F(z) fxi(dz) . 
^Jx ' Jx Jx Jx 

As a consequence, one has E F{y) n^idy) < (^F{x), and the claim follows. □ 

Remark 5.7 A useful identity is the following. Denote by {jj^^^yex a collection of 
independent copies of recursive Poisson point processes with "initial conditions" y. 
Then one has the identity in law 

mL°°^=^x+ / ^li{dy) , (5.23) 
Jx 
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where, as before, is a realisation of a Poisson point process with intensity Q(x, •), 
which is itself independent of the This identity makes sense since the integral on 
the right is really just a countable sum. 

5.2.2 Construction of the Brownian fan 

We now denote by £ the set of excursions with values in R. We consider elements of 
£ as triples (s, t, y) Where s < i G R U {+00}, and y G C(R, R) has the property that 
Vt = yt for t > t and = Us for r < s. We also write Sq for the subset of those 
triples (s,t,y) such that s = 0. 

Denoting a generic excursion by w, we write 5(w) for its starting time and z{w) for 
its end time, i.e., s(s, t, y) = s, and e(s, t, y) = t. We also denote by {{w) the lifetime 
of the excursion, which is the interval \{w) = [s(w), e(w)]. In order to keep notations 
compact, we will also identify an excursion with its path component, making the abuse 
of notation Wt = yt- There is a natural metric on £ given by 

d(iZ;,w) = d[(w,M;)V V2"''(lA sup \wt - Wt\) , (5.24) 
t^i ^ l*l<2'= ^ 

where the distance d( between the supports is given by 

d[(w, w) — 1 A {\s(w) — s(w)\ + I tanh e(w) — tanh e(w;)|) . 

The reason for this particular choice of metric is that it ensures that f is a PoUsh space, 
while still allowing for infinite excursions. 

For T e R and v G £,we denote by Qr,v : ^ ^ ^ the shift map given by 

Qr,v ■ (S, t,w)l-^ {S + T,t + T, W.+r + Vt) , 

which essentially changes the coordinate system so that the origin (0, 0) is mapped to 
(r, Vr). Denoting as before by Q the standard Ito exclusion measure, we now give the 
following definition: 

Definition 5.8 The Brownian fan with intensity a > is the recursive Poisson point 
process on £ with kernel 

Qiw,-)=^ / e;^Qdr, (5.25) 

and initial condition given by a realisation of Brownian motion, starting at the origin 
and killed when it reaches the level —L, where L is exponentially distributed with mean 
a. 



Remark 5.9 The reason for killing the original Brownian motion at this particular level 
is natural, due to the distribution of the initial tag in Step 1 of the algorithm. It is 
however essentially irrelevant to the mathematical construction. 

Remark 5.10 Formally, the Brownian fan is a particular case of the Virgin Island 
Model [Hut09] with a playing the same role in both models, h = a, and g = 1/2. The 
differences are twofold. First, the case of constant non-vanishing a actually doesn't fall 
within the framework of [Hut09] since the author there uses the standing assumption 
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Figure 6: A typical realisation of the Brownian fan with intensity a = § over the time 
interval [0, 1]. Successive generations are draw in Ughter shades of grey. 

that a(0) = 0. The other difference is mostly one of perspective, while we have so 
far defined the Brownian fan as a point process on a space of excursions, one of the 
purposes of this article is to show that it is also well-behaved as an actual Markov 
process with values in a suitable state space of (possibly infinite) point configurations. 

Remark 5.11 By only keeping track of the genealogy of the particles and not their 
precise locations, one can construct a "real ttee" on top of which the Brownian fan is 
constructed (loosely speaking) by attaching a Brownian excursion to each branch. This 
is very similar in spirit to Le Gall's construction of the Brownian map [LG07, LGIO], 
starting from Aldous's continuous random tree [Ald91]. The scaling properties of the 
Brownian fan however are quite different. In particular, Theorem 5.13 below implies 
that the underlying tree has Hausdorff dimension 1, as opposed to the CRT which has 
Hausdorff dimension 2. 

Before we proceed, let us show that it is possible to verify the assumptions of 
Lemma 5.6, so that this object actually exists for every a > 0: 

Proposition 5.12 The kernel Q defined in (5.25) satisfies the assumption of Lemma 5.6 
with the choice 

provided that t] is large enough. 

Proof. It follows from the properties of Q that there exists a constant C independent of 
7 and a such that 

f n r^(^'> r°° 

/ F{w)Q{w, dw) = —= / e""^ / (1 - e^"'')s-=^/2 
Je vStt Js(w) Jo 
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where C is a constant independent of a and rj. The claim then follows by choosing 

^> Ca. □ 

The remainder of this section is devoted to a study of the basic properties of the 
Brownian fan. In particular, we will show that there exists a suitable space X of particle 
configurations such that it can be viewed as a A* -valued Markov process with continuous 
sample paths that satisfies the Feller property. 

5.2.3 Number of particles and workload rate 

Define the set Aft C £ of excursions that are "alive at time t" by 

N't = {w €£ : s(w) <t< e(w)} . 

With this notation, the number of particles alive at time t for the Brownian fan is given 
by 

which is in principle allowed to be infinite. 

Theorem 5.13 There exist a constant C > and a strictly positive continuous decreas- 
ing function A: R+ — )• R+ such that 

Eexp(AtJVt) < C, (5.26) 

holds uniformly over all t> 0. 

Remark 5.14 We will see in the proof that one can choose A of the form 

At = K-^e-^* , (5.27) 

for K sufficiently large. 

Proof For A > and s, t > 0, set 

7V,^,, =logEexp(A/4r'(M)), 

where w is any excursion starting from with hfetime s. Since, by the definition (5.25), 
the value of N^^^ does not depend on the precise choice of excursion, we do not include it 
in the notation. It also follows from the construction of fj^^^ that the function 1 1— )• N^^ 
is increasing in t and that N^^ = N^^ for s > t. 
We also define M/* by 

= log E exp (a ^ fi^^\Aft) M(dw)) , (5.28) 

where M is a Poisson random measure with intensity measure Q and the realisations 
/i^^^ are independent of Q and of each other. While measures the total number of 
offspring alive at time t due to an excursion starting at time 0, measures the rate at 
which these offspring are created. 
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Indeed, combining (5.23) with the definition of /x^ and the superposition principle 
for Poisson point processes, we have the identity 

N^^ = Xls>t + a / Ml^ dr . (5.29) 

It therefore remains to obtain suitable bounds on M^. 

It follows from (5.29) and standard properties of Poisson point processes (see for 
example [PZ07, Theorem 6.3]) that one has the identity 



Mt^ = J {Eexp{Xij,^^\Aft))-l)Qidw) = J {e^^^'^""^'*^ - l)Qidw) 
< (exp (Xl^^ (w) + aj^ M^_^ ds) - 1 j Q(dw) . 



At this stage, we note that the integrand appearing in this expression depends on w only 

through e(w). It is then convenient to break the integral into a contribution coming from 
t{w) > t, as well as its complement. Since, under Q, the quantity t(w) is distributed 

— 3/2 

according to the measure ds, this yields 

< (exp(A + a j dr) - l) Q(e(«;) > t) 

J 

(exp(^ 



Mjl^ dr]-l] ^= ds 



We now assume that both A and t are sufficientiy small so that 

A + a / M^dr<\. (5.30) 
Jo 

This assumption allows us to use the bound e' — 1 < 2t, so that we obtain the more 
manageable expression 

ds 



<^—[X + a \ M^dr)+2a / M^^^dr 
\/27r ^ Jo ' io Jo 



2-K 



+ — ^ / M;' dr + 2a / M;^ dr ^ — ds 

V2tt V2^ Jo Jo Js V2tt 



< 



V27r V27r 
4At-V2 8a 



/ dr + -= / (t - s)-^/2 Mj^ rfs 
Jo v27r Jo 



\/2^ io 



/27r 

Writing ifj^ = t^l'^Ml, we thus obtain the boimd 



* " ^f2^ sf2^ Jo 

We can now apply the fractional version of Gronwall's lemma [NR02, Lemma 7.6] 
(with b = a = -4=, and a = h), so that there exists a constant C > depending 
on a but independent of A such that 

< CAexp(CQ . 
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From this, we immediately deduce from (5.29) and the definition of a similar bound 
on N^^. Choosing A = K~^e~'^'' for sufficiently large t then allows to satisfy (5.30) 
and to obtain A^^^^ < 2, thus completing the proof. □ 

As a corollary, we obtain a rather sharp bound on the modulus of continuity of the 
total workload process Wt = p^Nsds. One has 

Proposition 5.15 For every T > 0, one has 

sup hm — — — - — - < 00 , (5.32) 
t<Th^O h\\ogh\ 

almost surely. 

Proof. It follows from the generalised Young inequality that, for every a,h & R+, and 
every A, > 0, one has the inequahty 

a6 < Y (e^" - 1 - Aa + (1 + b/r}) log(l + 6/r?) - h/rf) , 
A 

so that 

Nt < ^(e^^* + (1 + l/77)log(l + 1/r/)) . 
It follows immediately that 

- W,| < ^ f^^' e^^» ds + logd + 1/ry) . 

Setting 7/ = /i, we obtain the bound 

m+h -yVt\<\ e^^= ds + Cxh\ log h\ , 



uniformly over all /i < 1 and all f € [0, T]. The claim now follows immediately from 
Theorem 5.13. □ 

Although the number of particles alive at any deterministic time has exponential 
moments, there exists a dense set of exceptional times for which Nt = oo. For one, this 
follows from the fact that, under Q, e(w) is distributed proportionally to s~^^'^ds, so 
that every particle creates an infinite number of offspring in every time interval. 

Actually, one has the even stronger statement that there is a dense set of exceptional 
times at which the number of particles belonging to the first generation of offspring 
is infinite. Indeed, if we denote by M a Poisson random measure on with density 
cs"''/^ dr ds for a suitable constant c, then the number of particles in the first 
generation of offspring is given by 

Nl = MiAt) , At = {(r,s) e [0,t] x R+ : s > t - r} . 

For A; > and d e {1, . . . 2'=}, we then set 

Ak,d = [(rf- l)2-^(^2-'=] X [4-^4l-'=] , 
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so that, by the scahng properties of M, the random variables Nk.d = M(Afe.(i) form a 
sequence of i.i.d. Poisson random variables. For any given point (r, s), we set -B(r,s) = 
[r, r + s], which is the set of times t such that (r, s) G At, and we set 



Since the set D(^r,s) is infinite for every (r, s), we can then build a sequence (r„, s„) 
recursively in the following way. Start with (ro, sq) = (0, 1) and then, given (r„, s„) for 
some n > 0, define (fc„, rf„) as the first (in lexicographic order) element (k, d) G -P(r„,s„) 
such that NkM > 1- We then set (r„+i, Sn+i) to be one of the points of M located 
in Ak^^dn- construction, one then has r\n>iB(r„.s„) = {t} for some t € [0, 1], 
and M{At) > M{Ak,^,drJ = oo, as stated. Of course, the interval [0, 1] in this 
procedure is arbitrary. If we want to show that there exists an exceptional time within 
any deterministic time interval [to, ii], it suffices to start the algorithm we just described 
with ro = to and so =ti — to- 

5.3 The Brownian fan as a Markov process 

In this section, we slightly shift our perspective. We do no longer consider our process 
as a point process of excursions, but we consider it as an evolving system of particles. 
Our system will therefore be described by a Markov process in some space of integer- 
valued measures on a subset of corresponding to the admissible combinations of 
"position + tag". The problem is that, as we have seen in the previous section, there are 
exceptional times at which the limiting process consists of infinitely many particles. 
The first challenge is therefore to construct a space X of integer-valued measures with 
a sensible topology which can still accommodate these "bad" configurations in such a 
way that the limiting process is continuous both as a function of time and as a function 
of its initial configuration. 

Once this space is defined, we show that the Brownian fan possesses the Feller 
property in X (i.e. the corresponding Markov semigroup leaves the space of bounded 
continuous functions invariant). In fact, we will show that it preserves the space of 
Lipschitz continuous functions. The continuity property of the discrete-time process 
established below in Proposition 6.8 also implies the time continuity of the Brownian 
fan (see Theorem 6.11 below). These properties allow us to conclude that the Brownian 
fan 1 1-^ fxt is in fact a strong Markov process. 

In Section 5.3.4 below, we will furthermore compute its generator A on a class of 
"nice" test functions. We will also present a "negative" result showing that if we denote 
by the one-step Markov operator corresponding to the evolution of Algorithm 2, then 
one has A ^ lime_j.o e~^(T'e — 1). This is in stark contrast with, for example, Euler 
approximations to stochastic differential equations, where such an equality would hold, 
at least when appUed to sufficiently regular test functions. 

5.3.1 State space 

Our construction is essentially the Wasserstein-1 analogue of the construction given in 
[FGIO]. Let Ai C R" be a convex open set with boundary dA4. For p G (0, 1], we 
then denote by i^iM) the set of all integer-valued measures iionM such that 



D(r,s) = {(k,d) : Bi^r'.s') C B(r,s)V(r',s') G Ak^a] ■ 




(5.33) 
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where d(y, dAi) denotes the (Euchdean) distance from y to the boundary of Ai. Note 
that since this quantity vanishes at the boundary, there are elements fx € £P{M) such 
that fJ,iM) = oo. 

We endow £p{A4) with a slight modification of the Wasserstein-1 metric by setting: 

Wli - = sup ( / f{y)fi{dy) - [ fiyMdy)) , (5.34) 
/eLip°(M) J 

where we denoted by Lipp(A^) the set of all functions f:A4^R such that 

\f(x) - f(y)\ <\x- y\P , (5.35) 
for all X, y e M, and /(y) = for all y e dM. 

Remark 5.16 Our notation is consistent in the sense that if we take for v the null 
measure in (5.34), then we precisely recover (5.33). This can be seen by taking f(x) = 
d^ix, dM), which is optimal by (5.35) and the triangle inequality. 

If /It and V happen to have the same (finite) mass, then the expression (5.34) does 
not change when one adds a constant to /. In this case, we are thus reduced to the 
usual Wasserstein-1 distance between ^ and v, but with respect to the modified distance 
function 

dpix, y) = \x- y\P A {dFix, dM) + d^iy, dM)) . 

Note that the completion of M under the distance function dp consists of MU {A}, 
where A is a single "point on the boundary" such that dp{x, A) = dP{x, dM) for every 

X e M. 

If one has niM) < v(M) < oo, then the distance || • \\p reduces to the Wasserstein- 
1 distance (again with respect to dp) between /l and v, where Ji is obtained from 
fx by placing a mass u{M) — n(M) on the boundary A. The following alternative 
characterisation of (5.34) in the case of purely atomic measvu^es is a version of the 
Monge-Kantorovich duality in this context: 

Lemma 5.17 Consider a situation where fi = J2iLi ^x, cmd v = Then, 

N+M 

IIm-J^IIp= inf ^ <P{Xi,y„(i^) , 

where Sn+m is the group of permutations of N + M elements and we set Xj = A for 
j > N and yj = Aforj > M. 

Proof. See for example [Pra08]. □ 

This characterisation suggests the following "interpolation" procedure between 
elements in £''(M). Let ^ = ^^^iSx^ and v = J2iLi^yi^ where we assumed that 
both measures charge the same number of points (this is something that we can always 
achieve by possibly adding points on A), assvune furthermore that these points are 
ordered in such a way that 

N 
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Again, this can always be enforced by suitably reordering tiie points and possibly adding 
points on the boundary. We then define, for t S (0, 1), the "hnear interpolation" Ltin, ^) 
by 

N 

LtijJL, I') = ^ 5zi , Zi = ty^ + (1 - t)xi . (5.36) 
j=i 

Note that this procedure is not necessarily unique, but it is easy to resolve this ambiguity 
by optimising over the possible pairings {ixi,yi)} realising the above construction, 
according to some arbitrary criteria. 

In any case, one can check that this construction then has the property that 

\\Lsil^, - Ltin, i/)\\p <\t- sfWiJ, - iy\\p , (5.37) 

for any s,t G [0, 1], which will be a useful fact in the sequel. 

5.3.2 Definition of the process 

For the remainder of this section, we set 

M = {(x, v)€R'^ : V > -ax} , 

which is the natural configuration space for our process. We will use capital letters to 
distinguish points in Ai from points in R. By Theorem 5.13, we already know that, for 
any fixed time t, the Brownian fan almost surely has only finitely many particles aUve at 
time t. Define now the evaluation map Efi £ ^ MU {A} by 

E(w) = ^ ^'^*' ~'^'^«(«')) ^ ^ 
* I A otherwise. 

For a given "initial condition" (x, v) G M, we then set 

which is an £P(A^)-valued random variable. Here, w is a realisation of a Brownian 
motion starting at x and killed when it hits —v/a, and is the corresponding 
realisation of the Brownian fan. (Just so that Et has the correct effect on w, one can 
for example set s{w) = —1 and make sure that w(—l) = —v/a.) As a consequence 
of Theorem 5.13 and of our definition of the Brownian fan, we then indeed have 
lit e fiM) for every p < 1. 

Note at this stage that we can simply discard the, typically infinite, mass on A 
by identifying measures that only differ on A. As already mentioned earlier, this is 
consistent with the identification A ~ dA4 already made in the interpretation of the 
construction of PiM). 

This construction can be extended to any initial condition in with finite total 

mass, by considering independent Brownian fans for each particle. As a consequence 
of the Markov property of the Brownian excursion and the independence properties 
of Poisson point processes, it is then straightforward to verify that t ^ fit is indeed a 
Markov process. 

Actually, by Proposition 6.2, we know that for any fixed collection of deterministic 
times {<!,... ,tk}, one has jitkiM.) < oo almost surely, so that our construction 
determines a probability measure on (£^'(j\4))^+ by Kolmogorov's extension theorem. 
At this stage however, we know absolutely nothing about the continuity properties of 
this process, and this is the subject of the remainder of this subsection. 
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5.3.3 Feller property 

We now show that the Brownian fan constructed in Sections 5.2.2 and 5.3.2 has the 
Feller property in £P(M.) for every p < 1. 

As in [FGIO], we could have defined spaces £P(M) is a natural way for p > 1. 
However, the Feller property would fail in this case because of the following simple 
heuristic argument. For any p, we can change the initial condition by an amount less 
than S in by creating TV particles at distance e = iS/N)^/^ from the boundary of M. 
For c small, the probability that any such particle survives up to time 1 (say) is bounded 
from below by ce for some c > 0. On average, the number of survivors will thus be 
of the order of eN ~ §^/pn^~^^/^ . Furthermore, at time 1/2, each of these surviving 
particles will be at a distance of order 1 of the boundary of M. As a consequence, by 
increasing N but keeping 6 fixed (or even sending 5 to sufficiently slowly), the law of 
the process at time 1 with an initial condition arbitrarily close to can be at arbitrarily 
large distance of 0, so that the Feller property fails. 

For p < 1 on the other hand, we have 

Proposition 5.18 For any p < 1, the Brownian fan gives rise to a Feller process 
in £P(A^). Even more, the corresponding Markov semigroup preserves the space of 
bounded Lipschitz continuous functions. 

Proof. For any two initial conditions /xq and jlo, write as before 

N N 

with the Xq'' e M. and Xq^ e M. chosen in such a way that 

N 

IImo - M\p = Yl (^o'^ ^o'') • (5.38) 

Our aim now is to show that there exists a constant C such that 

E||/Ut - Mtllp < C^IImo - PoWp , 

independently oit < 1, where the pair (fj.i,p.i) is any coupling between the Brownian 
fans starting from /iq and jlo respectively. Denote by /i^-'^ the contribution to fit originat- 
ing from the initial particle X^^ and similarly for jl^^K Then, by the triangle inequality, 
one obtains the bound 

so that the claim follows from (5.38) if we can show that 

ni^Y>~fi^X<cd,(xi^\xi^'). 

In other words, it suffices to consider the special case when both /xq and p,o consist of 
one single particle, which we denote by Xq = {xq, vq) and Xq = (xq, vq) respectively. 

One then constructs a coupling between the two processes fit and fit by running 
both particles with the same Brownian motion and spawning children according to the 
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same Poisson process (as long as the corresponding particle is alive). We denote by Xi 
and Xt the evolutions of the two initial particles in M, driven by the same realisation 
of a Brownian motion, and stopped when they reach dM. We can assume without loss 
of generality that uo + axi^ < vq + o^x^i, so that the particle X dies before the particle 
X. Denoting by r and f the respective lifetimes of these particles, one thus has r < f. 

Denote now by Mj the (random) measure on [0, x which is such that, for 
/ C [0, t] and A C M, Mi (I x ^4) is the number of particles in A at time t that are 
offspring of a particle created from the "ancestor particle" X or X at some time s G /. 
With this notation, if we denote by E: M ^ A4 the map 

E(x, v) ^ {x + xq - Xq, V - a{xo - Xq)) , 

then one has the decompositions 

MA) = lr>tSxM) + S*Mt([0, TAt]xA), 
MA) = lf>tSxM') + M*([0' fAtjxA). 

Denote now by 5 the Euclidean distance between the two initial particles, so that their 
£P-distance is 5^. It then follows immediately from the above decomposition that one 
has the bound 

llMt - At Hp < + lte[T,f]dpiXt, dM) + 5PMti[0, TAt]xM) 

+ I dpiY, dM)MtilT At,f At]x dY) . 
Jm 

Since d(X{T)^ dM.) = (5 by the definition of r and 5, it follows from Jensen's inequality 
and the Martingale property of (stopped) Brownian motion that one has the bound 
^lt(^[r,f-\dp{Xt, dM) < SP. It also follows from Theorem 5.13 that EMt([0, r A x 
M) < oo, independently of 5. Finally, it follows from an argument very similar to the 
proof of Theorem 5.13 that 

E / dpiY, dM)Mtids x dY) < C ds , 
Jm 

uniformly over s e [0, i]. It follows that 

E / dpiY,dM)Mti[TAt,fAt]xdY)<CE\fAt-TAt\<C6, 
Jm 

where we used the fact that if ts is the first hitting time of by a Brownian motion 
starting at S, then E{ts A 1) < C6. Combining these boimds completes the proof. □ 

5.3.4 Lack of convergence of the generators 

One standard method to prove convergence of a sequence of Markov processes to a 
limiting process, once tightness has been established, is to show that the corresponding 
generators converge in a suitable sense. In our situation, one actually does not expect 
the generator of the approximate process to converge to that of the limiting process, 
when testing it on "nice" test functions. We first argue at a technical level why this is 
the case, before providing an intuitive explanation. 

Inspired by [EK86, Daw93], we consider test functions of the form 



F(/x) = exp((log/,/x)) , 



(5.39) 
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where f : A4 ^ R+ is a sufficiently smooth function such that f(x, v) = 1 for (x, v) G 
dM. This boundary condition is required since elements /z € i^iM) can have infinite 
mass (and, as we have already seen, the limiting process really does acquire infinite 
mass at some exceptional times) accumulating near dA4. Being in £P{Ai) forp < 1 
does however ensure that smooth functions such as above are integrable. 

Exploiting the independence structure of the process as well as its space homogeneity, 
we can reduce ourselves to the case of an initial condition of the form /ig = ^(x.v) for 
some V < —ax. In this case, for sufficiently small £ > 0, the probability that the 
original particle dies within the time interval s is of the order for any p. We therefore 
only need to take into account the possibility of creating some descendant(s), with 
the killing mechanism being taken care of by the boundary condition of /. While the 
average number of "second generation" descendants is of order s, any such descendant 
will typically have travelled to a distance of order y/e from dA4, so that only the first 
generation has a chance of contributing to the generator. 

We then have 

^ {EF(n,) - F(no)) « Aofix, V) + ^^^E{f - 1, M,) , 

£ £ 

where 

is the generator of Brownian motion, and where is the (projection to time £ of 
the) Poisson point process yielding the first generation of offsprings. Note now that 
since these offspring will be created near dM and since / = 1 there, we can further 
approximate this expression by 

- (EF(/x,) - F(/xo)) « Aofix, V) + ^^^f'ix) E [ (x + a-'v)M,idx, dv) . 

Here, we wrote f'{x) as a shortcut for dx,f{x, v)\^^_^^. Denoting by Cs the position, 
relative from its starting point, of an excursion of length s and making use of the formula 
(5.25) for the intensity measure of M^, we obtain for the last term in this equation the 
expression 

-E (x + a-^v)Me(dx,dv)= / / Ee^it) dt s-^'"^ ds . (5.40) 

e Jm 2V2ne Ja Jo 

At this stage we note that, for a Brownian excursion of length s, we have foit<s the 
identity 

Ee,(i) = J—{s - t) . 

V TTS 

Inserting this into (5.40), a tedious but straightforward calculation then yields 



hm / {x + a~^v)Ms{dx, dv) = " 
£ Jm 2 



so that we finally obtain for the generator A the expression 

AF(Mo) = Aifix, V) + p{x, v)f'{x) . (5.41) 

Recall that this is for the particular case where no = 5(x,v)- In the general case, we can 
use the independence structure of the process to obtain 

AFiuio) = FiuLo) I (^^^^^ + ^fix)) fj^oidx, dv) . 
Jm^ f(x,v) 2 ) 
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Remark 5.19 Compare this with the generator of a usual branching diffusion, where 
the term fix) would be replaced by a(f(x) — 1), with a the branching rate. 

On the other hand, if we denote by Tg the Markov operator describing one step of 
Algorithm 2, we might expect that one also obtains A as the limit (T^ — 1) as e ^ 0. 
This would indeed be the case if there was no branching or if branching did only occur 
at a finite rate. Considering again initial conditions of the form ;Uo = S(x,v) for some 
V < —ax. By reasoning similar to that in the beginning of Section 5, we obtain 



which is always dijf event from (5.41), and is actually what we would have obtained from 
the wrong guess (5.7). 

A possible reason for this discrepancy is that, while the Markov semigroup of the 
Brownian fan does indeed preserve test functions of the type (5.39), we do not expect 
this to be true of the Markov operator Tg. Instead, the "correct" space of lest functions 
for is of the same type, but the function / should have a "boundary layer" near dM. 

Remark 5.20 Another reason why the generator of the Brownian fan is not such a 
useful object is that many seemingly innocent observables, Uke for example the total 
number J\f of particles, do not belong to its domain. This follows from the fact that if 
we consider again a simple initial condition /io as above, then EA/^(/ig) — 1 « ^/e for 
small e. The reason why the total number of particles nevertheless remains finite (at 
least for fixed times) is that there are exceptional states where one or more particles are 
very near dM and for which EAA(/ie) - 1 « -0(1). 

6 Convergence to the Brownian fan 

The aim of this section is to provide a rigorous proof of the fact that, in the situation of 
the previous section, the process given by Algorithm 2 converges to the Brownian fan in 
C([0, T},IP{M)) for any T > and p < 1. The overall strategy of the proof is classical: 
we first prove a tightness result in Section 6. 1 and then show that finite-dimensional 
marginals converge to those of the Brownian fan in Section 6.2. 

Difficulties arise on two fronts. First, to prove the tightness result, it is convenient to 
have uniform moment bounds on the number of particles at fixed time for the approxi- 
mating system. These turn out to be much more difficult to obtain for the approximating 
system than for the Brownian fan, which is mainly due to a lack of uniform exponential 
bounds. A second difficulty arises in the proof that finite-dimensional distributions 
converge to those of the Brownian fan. While it is intuitively clear that those excursions 
that survive for times of order 0(1) do converge to suitably normalised Brownian ex- 
cursions, this result is rather technical and, surprisingly, does not seem to appear in the 
literature. Furthermore, no convergence result holds for the typical excursions which 
die very early. We therefore also need to argue that, both at the level of Algorithm 2 and 
at the level of the Brownian fan, these small excursions do not matter in the hmit. 

6.1 Tightness 

As in the previous section, we restrict ourselves to the particular case when the underly- 
ing Markov process is given by a rescaled random walk, namely 




(5.42) 



y(k+l)e = Uke + Ve^fe+l , 



(6.1) 
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where the are i.i.d. random variables with distribution v having some exponential 
moments, and where the potential V is given by a linear function, V(x) = —ax. Our 
aim is to show that as e ^ 0, the sequence of birth and death processes obtained by 
running Algorithm 2 is tight in a state space X, which we will now describe. 

6.1.1 Formulation of the tightness result 

With the construction of the previous section in mind, we choose as our state space 
X = £P{M), where M = {{x, v,n) G x N : v > —ax}, and p < 1 is arbitrary. 
Here, the coordinate n is used to keep track of the generation of a particle: direct 
offspring of a particle from the nth generation belong to the (n + l)st generation. We 
extend the Euclidean distance to x N by additionally postulating that the distance 
between particles belonging to different generations is given by the sums of the distances 
of the two particles to dAi. The boundary dAi is given as before by dAi = {(x,v,n) : 
V = —ax}. We will use capital letters for elements of A4 to differentiate them from 
elements of R. 

In order to formulate our result, we will make use of the following notation. For 
t = ke with k an integer, we denote by /if the empirical measure of the particles alive 
at time t, and by Nt the number of such particles. Sometimes, it will be convenient to 
consider the particles instead as a collection of elements in X'^-' ^ e M, so that we write 



We do not specify how exactly we order the particles, as this is completely irrelevant 
for our purpose. For t G {ke, {k + 1)£), we define /if by using the "hnear interpolation" 
procedure (5.36), setting 



The interpolation procedure Lg is a very minor modification from the one described 
above, in the sense that we only connect particles belonging to the same generation. In 
this way, the process f /if has continuous trajectories for every e. The main result of 
this section is as follows: 

Theorem 6.1 Let p < 1 and denote by the law of the process t ^ described 
above, viewed as a family of probability measures on C([0, 1], X). Assume furthermore 
that there exists c > such that J e'^^^^vidy) < oo. 

Then, for any single particle initial condition /iQ = Sxg with Xq g Ad, there exists 
£o > such that the family {£'^}e<so tight 

Proof. Combining [Daw93, Theorem 3.6.4] and [Bil99, Theorem 8.3], we see that, in 
order to obtain tightness, it is sufficient to show that: 

• For every S > 0, there exists a compact set Kg <z X such that P(/if e Ks) > 
1 — d, uniformly over t G [0, 1] and e < sq. 

• There exists a > and C > such tiiat E||/if - /i|||9 < C\t - s|"«, uniformly 
over all s, f G [0, 1] and e < Sq. 

The first claim then follows from Proposition 6.7 below, while the second claim is the 
content of Proposition 6.8. □ 




s = e — k . 
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The proof of this resuh is the content of the remainder of this subsection and goes 
roughly as follows. In Section 6.1.2, we obtain a moment bound on the number of 
particles alive at any fixed time t G [0, 1], which is uniform in e > 0. This then allows to 
obtain the compactness at fixed time in Section 6.1.3. The verification of Koknogorov's 
continuity criterion is the content of Section 6.1.4. 

6.1.2 Moment bounds on the number of particles 

In the sequel, we wiU denote by Kp(X) the pth cumulant of a random variable X and by 
Kp{X I F) the same cumulant, conditioned on the tr-field F. We will use the important 
property that Kp(X + Y\F) = Kp(X \ F) + Kp(Y \ F), provided that X and Y are 
independent, conditionally on F. We will also use the fact that if X is a positive random 
variable, then Kp(X) < EX^ and there exists a constant C such that the bound 

EXP < C ^(KgCX))^/" , (6.2) 

holds. 

Our aim now is to obtain a bound on the cumulants of the number of particles alive 
at time t which is independent of e. We start with an initial configuration containing 
only one particle, which belongs to generation 0, and we set e {0, 1}, depending 
on whether or not this particle is still alive at some subsequent time t. We also define 
N" = where M" = {(x,v,ti e M : £ = n}, which is the number of 

particles in the nth generation that are alive at time t. We also denote by A^"j the 
number of such particles that were created at time s <t. 

For any Xq = (x, v) with v > —ax, we write Ex„ for expectations of observables 
for the process generated by starting Algorithm 2 with underlying dynamic (6.1), started 
with one single initial particle in generation at location x with tag v. Finally, for 
a; € R, we write for the same expectation, but where the initial particle has tag 
V = +0O, meaning that it is "immortal". We then have the following result: 

Proposition 6.2 Consider the situation of Theorem 6.1. For every p > 0, there exist 
Cp > and £o > such that, under the rules of Algorithm 2, the number Nt = ijlI{M.) 
of particles alive at time t satisfies Ex\NtY' < 2exp(CpQ, uniformly overall e < eq 
and a; e R. Furthermore, there exists q>Q such that 

^xN^ <{n+ l)*/e2-" , (6.3) 

uniformly over all e < Eq, n > 0, and i > 0. 

Finally, denoting by R] the number of offspring alive at time t that have never been 
at distance more than ^from dA4, we have the bound 

uniformly over t G [0, 1], 7 € (0, 1], and e < £0 A 7^. 

The proof will make use of the following elementary fact where, for A = n + p 
with n e N and p G (0, 1], we denote by 1(A) the law of a random variable Y such 
that Y = n with probability 1 — p and Y = n+1 with probabiUty p. (This is so that 
EY = A.) 

Lemma 6.3 Let Y be a random variable with law 1(A). Then, for any q> 1 and any 

A > 0, one has the bound Er« < A + (2A)«. 
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Proof. By inspection, one has 

Ey« = p{n + 1)9 + (1 - p)n« . 

In the case n = 0, one then has EF^ = \^ so the statement is trae. For n > 1, one uses 
the fact that both n + 1 and n are boimded by 2A, and the claim follows at once. □ 

Proof of Proposition 6.2. We restrict ourselves to times that are integer multiples of e. 
Furthermore, from now on, we fix an initial condition x, so that we just write E instead 
of Ea;. We also denote by F" the cr-algebra containing all information pertaining to 
particles in generations up to (and including) n. With this notation at hand, we obtain 
for the bound 

p 



E(iVn''<E^(K,(7V,"|F"-i)) 

g=l 

= E^(^K,(iV5,,|F"-i)) 



p/q 



g=l ee<t 

< i2^{j2 n\Ke,t\'' I F"-^))'^' , (6.4) 

q=l si<t 

where we used (6.2) in the first step and the independence of the offspring in the second 
step. In the above expression, £ takes only integer values. Note now that 

y , ^ < C , (6.5) 

uniformly over all f > £. As a consequence, for any positive sequence a„ and any power 
r > 1, one has the bound 



^^ri 

sl<t 

where we have set = s~^^tit + e — el) ae in the intermediate steps. Applying this 
inequality to (6.4), we obtain 



p 



P — Q 



mrr < E E i f--))^^^ . (6.6) 

q=lel<t £ " 

Denote now by M"'^ the number of particles in the nth generation created at time s 
by the jth particle from the n — 1st generation. We write Gg for the a-algebra generated 
by this additional data. Each of these particles yields a contribution to N^^. of either 
1 or 0, depending whether it survives or not. Furthermore, these contributions, which 
we will denote by 5^'/'*, are all independent and, for the same value of j, they are also 
identically distributed. By definition, we thus have the identity 

E E^sr- 

j=i i=i 
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We now twice make use of the inequality 

(E«.)'<"^'"'E«I. (6.7) 

which is valid for any > 1, m > and sequence of positive numbers aj. This yields 
the bound 

{N-,^,f < {N--x^<^-' E iKi'r' E ■ (6-8) 
j=i i=i 

Note that since S^f^ can only take the values or 1, raising it to the power q makes no 
difference. By the "gambler's ruin theorem" [LLIO, Thm 5.1.7], we have the bound 



E(5,",f I F"-i V Ge,) < CV~e^j^^L= , (6.9) 



where ^/e^^ denotes the step performed by the jth particle of the (n — l)st generation 
between times eii — 1) and e£. Regarding the nvmiber of offspring M'^f , it follows 
from the definition of the algorithm that its distribution is given by I(exp(av^^^) — 1), 
where denotes the positive part of . Combining this with (6.8), Lemma 6.3, and 
(6.9), we thus obtain the bound 

E(iiv3,*i'iF"-^)<iA^(ri)er' E vt+e-ei — ^^'+^^- 

In order to simplify this expression, we use the fact that, for a; > 0, there exists a 
constant C depending on q such that 

- 1 < xe^ , (e^ - 1)« < Cxe«^ , 

for every q>l. This yields 

Using (6.7) once again, we get the bound 



^.=1 (t + e-£f)55 

At this stage, we note that, conditional on the state of the system at time e{(. — 1) the 
steps are all independent and identically distributed with law u. Setting 



Jo 



it follows that 



(t + e - e£)2i 
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Inserting this into (6.6) yields for t < 1 the bound 

On the other hand, since we assumed that the initial particle is immortal, iV"~^ is 
stochastically increasing as a function of s, so that we obtain from (6.5) the bound 

Since, by the exponential integrability assumption on i^, there exists C > such that, 
for £ small enough, Pe < C and since = 1, we conclude that there exists some \p 
such that, for all n > 1, one has the bound 

n^^f < (Api)"/' . (6.10) 

It follows that there exists > (depending on p) such that E(A^()^' < 2 (say), for 
every t < ti,. To show that E(7Vt)^' < oo for every t, we use again (6.7), combined with 
the Markov property of the process to conclude that E(Nt+t^ T ^ 2E( A^t)^, from which 
the first claim follows. 

To get (6.3), observe that if we choose g small enough so that Ai^ < 1/4, the bound 
with t < Q follows from (6.10). Denote now by J^t the filtration generated by all events 
up to time t. For t and g that are multiples of e it follows from the Markov property that 

n 

E(iVr|J-t-,)<^7V,%E7V^-^ 
e=o 

where the expectation on the right is taken with respect to an initial condition with one 
single immortal particle. The claimed bovmd for arbitrary t > therefore follows by 
induction. 

It remains to obtain the bound on R] . Denote by the number of offspring 
contributing to R] that are created at time s < t, so that R] = J2Ek<t ^ke *• Denote 
now by ^ the probabihty that, after time t, the random walk (6.1) with initial condition 
^/sz has never exited the interval [0, 6]. It then follows that 

/•oo 

E{Rl^\Tks) = aV^Nke C^^Ql^.^^dyHdz). (6.11) 

JO Jo 

This is because, by Step 3 of Algorithm 2, the expected mmiber of offspring created 
by a particle performing an upward step of size y/ez is given by e°v^^ - 1 while, if 
we denote by y^y the distance between the starting point of the offspring and the 
"wall" below which it is killed, the law of y is given by c^^ ga^/e^/ (j^y where c is a 
normalizing constant. Since c = (e"^^ — l)/ay/e, (6.11) follows. 

Using again the gambler's ruin theorem, combined with the Markov property of the 
random walk, we obtain for ^ the bound 

where Q] is the probability that the random walk (6.1), starting at the origin, stays 
within [—7,7] up to time t. Using the scaling properties of Brownian motion, combined 
with [Fra73], we conclude that one has the bound 

< 1 A — , 
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for every q > 0, so that 

Inserting this bound into (6.11), using the previously obtained bounds on N^^, and 
summing over all k, the claim follows at once. □ 

Remark 6.4 It follows from Theorem 4. 1 that, in the particular case when the initial tag 
V is distributed according to the logarithm of a uniformly distributed random variable, 
one has the identity EA^^ = Ee°^*, where yt is the rescaled random walk with steps 
v. The (at least one-sided) exponential integrability assumption on v is therefore a 
necessary assumption in order to obtain any kind of moment bounds on Nf. 

Remark 6.5 Even if we assume that has Gaussian tails and despite the result previ- 
ously obtained in Theorem 5.13, it is not true in general that has uniform exponential 
moments as e — )• 0. This is because even for the first step, the probability that the 
original particle performs a step of order e^^ is of order exp(— e^^^^^). If this were to 
happen, the number of offspring created in this way would be of order exp(£~^), which 
immediately shows that exponential moments blow up as e ^ 0. 

6.1.3 Compactness at any iixed time 

We now show that, for any fixed time t G [0, 1], we can find a compact set Kg such 
that the process iif belongs to Kg with high probability, uniformly over e. Our first 
ingredient for this is the following moment bound: 

Proposition 6.6 Consider the setting of Theorem 6.1. Then, there exist constants C 
and £o such that, for every t G [0, 1] and every e < eq, the bound 

Exo k - xol^^ntidx, dv, dk)^ < Ct^ , (6.12) 

holds uniformly over all initial conditions Xq = {xq, vq, 0) e A4. 

Proof. We restrict ourselves to the case when t is an integer multiple of s, since 
the bound on the remaining times easily follows from our interpolation procedure. 
Furthermore, we can restrict ourselves to the case when the initial particle is immortal, 
which formally corresponds to setting v = +oo. By translation invariance, we also 
restrict ourselves to the case where the initial particle is located at the origin, and we 
denote the corresponding expectation by Eq. 

It follows from Theorem 4.1 that if we choose v = ^ log u, where u is drawn 
uniformly from [0, 1] and denote by /if the corresponding process, then one has the 
identity 

Eo(^J^x^Pilt(dx,dv,dk)) =Eo(e"^'(yt)'^) , 

where yt is the simple random walk (6.1) started at the origin. It follows immediately 
from the exponential integrability of i/ that this quantity is bounded by Ct'^ for f < 1 
and for e small enough. 

On the other hand, one can realise the process /uf , which starts with an immortal 
initial particle, in the following way: 
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1. Consider the process /if, where the w-component is as above. 

2. When the initial (generation 0) particle is kiUed, replace it instantly by an immortal 
particle starting at the current location. 

Let denote the trajectory of the initial particle and let s be the time at which the initial 
particle is killed and replaced. Let Pt = P(s < t). By the construction just outhned, we 
have the recursion relation 

Ft,o = Et + PtE{Ft_,^^o\s<t), 

where we set 

^t,x° = Eo 
Et = Eq 

(Remember that the difference between Et and Ft.o is that in order to compute F, we 
start with an immortal particle.) Setting Ft = Ftfi, using the fact that, for every 6 > 
one can find C5 such that {x + a;^)^^ < Cs{xifP + (1 + S)x'^^, and recalling that 
Et < CtP, we deduce that 

Ft < CtP + CsPtE{{xyPE^o {Nt-s) I s < i) + (1 + ^)Pt ^{Ft-s \s<t), 

where Nt is the number of particles alive at time t for the system started with an immortal 
particle. Note now that, for t < 1, we know from Proposition 6.2 that the expected 
number of particles alive at any given time is bounded by some constant uniform in s. 
Since this bound is also uniform in the initial condition, we have 

PtE{ix°fPE,o{Nt-s \s<t))< PtE{ix°fP \s<t)= E{ixyn,<t) • 

Defining s = s At,we then obtain the trivial bound 

E{ixyn,<t) < Eo|2/s|'^ < Eolvtf" < , 

where we made use of the fact that \yt\'^^ is a submartingale to obtain the second 
inequality. 

Setting now Ft = supg<j Eg, we can combine all of these bounds to get the inequal- 
ity 

Ft < CstP + (1 + 6)PtFt . 

Since one can check (for example by again using the fact that the random walk approx- 
imates a Brownian motion for small e) that sup^^^ supj<^ Pt = supg<^ Pi < 1, the 
claim follows at once by choosing 5 sufficiently small. □ 

This result can now be used to deduce the announced uniform tightness result over 
fixed times: 

Proposition 6.7 Consider the setting of Theorem 6.1. Then, for every 6 > there exists 
a compact set Ks C X such that Pxo(Mt € Ks) > 1 — ^, uniformly over t e [0, 1], 
£ < £0 and Xq e M^. 



(^j {x + xlf^nlidx, dv, dk)j , 



x^^jl'lidx,dv,dk)) . 



M 
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Proof. For any n e Z+ and R e R+, denote 

A^W = {{x,v,t) &M : t<n} , Br = {{x,v,t) e : |a;| V \v\ < R} . 

For any such n and R and for m G Z_|_, we then denote by Kn^m,R C X the set of 
all integer- valued measures 77 on such that r}{^4 \^4^^^) = 0, rj{M.) < m, and 
77(A^ \ Bji) = 0. Since these sets are obviously compact, it suffices to find, for every 
5 > 0, sufficiently large values n, R and m so that Pxaipf G Kn.mM) > I — S 
uniformly over all e < eo and t E [0, 1]. 

Note that Kn,m,R C K^nK^nK^nK^ where, for and we only enforce 
the conditions involving n and m respectively. The set Kf^ consists of configurations 
such that the x-coordinate of every particle is less than R in absolute value, while K'^ 
enforces that the ?;-coordinate be less than R. 

It follows immediately from (6.3) that there exists 7 > and a constant C (depend- 
ing in principle on the time we consider, but it can be chosen uniformly over t G [0, 1]), 
such that 

for every n > 0. Similarly, it follows irom the moment boimds on obtained in 
Proposition 6.2 that, for every p> 0, there exists C such that 

In order to get a bound on the x-coordinate of the particles, we combine Proposition 6.6 
with Chebyshev's inequality, so that 

It remains to obtain a bound on the probability of not being in K^^. For this, we use the 

fact that on the one hand, the label of a particle always satisfies v > —ax. On the other 
hand, any descendant of the initial particle will always satisfy v < —a sup^^j^ a;°, where 
here we denote by the position of the original particle at time t. Since this particle 
was assumed to start at the origin, we obtain v'^ < o?x^ -\- sup«i(a;°)^, so that we 
obtain the bound 

just as above. Combining all of these bounds, the claim follows by choosing n, R and 
m large enough. □ 

6.1.4 Kolmogorov criterion 

The aim of this section is to obtain the following boimd on the time regularity of our 
process: 

Proposition 6.8 For every p < 1 and g > 1, there exists a constant C such that 

^xAtil-6xAl<C6^'''\ (6.13) 

where Xq & M.^ is an initial condition with only one particle in the system, and 5 <1 
is an integer multiple ofe. 
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Remark 6.9 As usual, the precise location of the particle is irrelevant by translation 
invariance, so the above bound is uniform over all choices of Xq. 

Before we proceed to the proof of Proposition 6.8, we observe that the bound (6.13) 
implies that Kolmogorov's criterion holds for the process over a fixed interval of time: 

Corollary 6.10 For every p < 1 and q> 1, one has the bound 

^x,M+s-^^l\\l<C5^''^ , (6.14) 
uniformly over all d,t,e G [0, 1]. 

Proof. Note first that we can restrict ourselves to the case when t and S are integer 
multiples of s. Indeed, it follows from the definition of || • ||p and from (5.37) that, if 

ke < s < t < {k + l)e, then 

llA^J-Mfll <e-''|t-sniA^L-Ma+i)ell • 
We then obtain from Proposition 6.8 the bound 

j=l 

where we made use of the Markov property, (6.7), and (6.14). □ 

Proof of Proposition 6.8. Denote by Xt the location at time t of a single particle starting 
at Xo = (xq, vq, 0) and evolving under the rescaled random walk stopped when it 
reaches the boundary of M. Denote by Xt the position in R corresponding to Xf. From 
the properties of the random walk and the definition of || • \\p, for every q > I there 
exists a constant C such that the bound 

ExA5x,-5xAl<C5'i^'\ 

holds, independently of the initial condition and independently of e < 1. 

Let us now bound the contribution from the descendants of the initial particle. 
Ordering the particles ahve at time t in such a way that the original particle has label 1 
(if it is not alive anymore, we consider it as being located on the boundary, where it was 
stopped), we have the bound 

Ns 

\\6x,-l.il\\j><Y,'ip{Xf,dM), 

3=2 

SO that 

Ns 

Exo - mi < Exo dpiX^\ 9M)y 

3=2 

/ Ns 

< ( (Ex.iNs - If-') (Ex,J^dpiXf,dM) 
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/ \ 1/2 

<c(Ex,^dl%X^\dM)) , 

where the second inequaUty follows from Proposition 6.2. Recall now that if X^-'' = 
(x, V, n), then one has dp{X^\ dM) = \x — v/a\^, where v/a is guaranteed to take 
values between infs<5 Xg and x. As a consequence, we have the bound 

\x — v/a\^ < |a; — inf Xs\^ <\x — xq\'' + sup \xs — xo\^ , 

SO that 

Ex, df(Xf, dM) < Ex, (Ns sup \xs - a^ol'^") 

+ Ex, J\x- xofP^t^'sidx, dv, dk) < S"" , 

where the last bound is a consequence of Propositions 6.2 and 6.6, as well as standard 
bounds on the supremum of a random walk. The claim now follows at once. □ 

6.2 Convergence of fixed-time distributions 

In this section, we show that any limiting process obtained from the tightness result 
of the previous section necessarily coincides with the Brownian fan constructed in 
Section 5.3.2. With the notations of that section at hand, our convergence result can be 
formulated as follows. 



Theorem 6.11 Consider the setting of Theorem 6.1 with an initial condition Xq = 
(a;, V, 0) e M., and consider as above with the "initial condition" for given by 
a Brownian motion starting at x, killed when it reaches the level — w/a. 

Then, for every p G (0, 1], there exists a version of the process {fit}t>o which is a 
continuous Markov process with values in i^iAd). Furthermore, denoting the law of 
its restriction to the time interval [0, 1] by C, the sequence of measures converges 
weakly to C in C([0, 1]J:p(M)). 

Proof. It suffices to show that, for any fixed collection of times {ti, . . . , tk], the law of 
{/Ltf . }i<k converges weakly to that of {/x* . }i<k- Indeed, Corollary 6.10 then impUes that 
the process fit satisfies Kolmogorov's continuity criterion and therefore has a continuous 
version. By Theorem 6.1, we deduce weak convergence in C([0, Yl^PiM)) from the 
convergence of marginals. Using the Markov property, the superposition property of 
the process, and Proposition 6.2, we reduce ourselves to the case k = l with an initial 
condition consisting of one single particle. 

Denote now by the random integer-valued measure on £ obtained by running 
Algorithm 2 until time 1. Observe that can be built in the following way. For an 

exciffsion w € £, we build a random measure Q^{w) by the following procedure. For 
every e N with ek > s(w) and eik + 1) < e(w;) A 1 we set 

AWk = W(k+l)e - Wks ■ 

If Awk > 0, we then draw a random variable with law X{exp{aAwk) — 1) (see 
(2.9) and Lemma 6.3). For j = 1, . . . , 7V^, we build i.i.d. excursions w'^'^ e f by the 
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following procedure. First, draw a uniform random variable u ^ U{exp{—aAwk), 1) 
and set v = logu — awke- Then, denote by {y^/ j^^o instance of the random 
walk (6.1), started at W(k+i)E and stopped just before it becomes smaller than —v/a 
(so that t/^'^ > —v/a). The excursion w'^'^ is then given by s(w'^'^) — ke, t{w^'^) = 
(L + k + 2)e, = for£ G {0, . . . ,L}, w^'^{le) = —w/a for the remaining 

integer values of and linear interpolation in between integer values. We then set 

oo Nl 

fc=i j=i 

which is the point measure describing the children of the particle with trajectory w. 
Similarly to before, we build recursively by the following procedure: 

• Build an excursion S f as above, starting at x and stopped at —v/a, where 
(a;, v,Q) & M. is the initial condition appearing in the statement. Set = 5^0 . 

• Given fj,^'^, define by 

where the {Q^(w)}«,e£ are all independent (and independent of the /U^'^ with 
i' < i). Note that the integral is actually a finite sum, so the construction makes 
sense. 

• Set /i^'t^^ = J2e'=o A**^'^ for positive £ (including the case £ = oo). 
If we set 

where £"/ is defined as in Section 5.3.2, then the process /uf is indeed equal in law to the 
process considered in Section 6.1 ' . 

Denote now by f -y the set of excursions of height at least 7, namely 

£^ = {w € £ : 3t e l(w) with wi > w(s{w)) + 7} • 

We also write /x^''"^ for the measure obtained exactly like /i'^''^"', but where we replace 
Q^(w) by its restriction Q^(w) to the set at every step, so that /^^^'^"^ < /jf-^^ almost 
surely. In words, QJ^ is obtained from Q*^ by discarding all excursions of height less 
than 7, as well as the descendants of any such excursions. 

Combining Propositions 6.2 and 6.6, we see that, for eq small enough, there exist 
constants C and a > such that one has the boimd 

sup P{E* ^ -BtV^'^"') < C(e-"" + 7") , (6.15) 

e<eo 

uniformly over e and t E [0,1]. 

Following an argument along the lines of the proof of Theorem 5. 13, a similar bound 
can be shown to hold for P{Effj,^°°^ ^ E^fj,]!!/^), where /ij^"^ is the recursive Poisson 
process of depth n constructed like /U^°°^, but with Qiw, •) replaced by its restriction 

' Strictly speaking, the two processes agree only at times that are multiples of e since the two interpolation 
procedures we used may differ when the trajectories of two particles from the same generation cross each 
other. This is irrelevant. 
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•) to As a consequence, we conclude that it is sufficient to show that, for 
every fixed 7 > and n > 1, 

lim 2?(M"'f">) = , (6.16) 

where !?(•) denotes the law of a random variable and convergence is to be understood in 
the sense of weak convergence on the space M+(£) endowed with the Wasserstein-1 
distance (see [Vil03] and Section 6.2.2 below). 

Our aim then is to make use of Theorem 6.17 below which gives a general conver- 
gence result for recursive Poisson point processes. The drawback is that /u^''"' is itself 
not a recursive Poisson point process, due to the fact that Q^(w) is not a realisation of a 
Poisson point process. However, it would be one if, in the construction of Q'^(w), we 
had drawn Nj^ according to a Poisson distribution with mean exp{aAwk) — 1. Denote 
by (Tiw) the Poisson point process obtained in this way, by its restriction to 
and by '"^ the recursive Poisson point process of depth n obtained by iterating Q^. 
We then claim that it is possible to find a coupling between /i^'^"^ and /U^''"^ such that 

P(/i^'M^/x;'M)<C„,^x/i, (6.17) 

where the constant Cn,j depends on n and 7, but not on e. Indeed, if we denote by Ux a 
random variable with law 1(A) and by C/^ a Poisson random variable with mean A, then 
it is straightforward to check that there exists a constant C and a coupling between Ux 
and Ux such that 

n\Ux - Ux\ = 1) < C(l A A^) , P(\Ux - Ux\ > 1) < C(l A A^) . 

Furthermore, by Proposition 5.1, the probability that the random walk started at x 
reaches 7 before becoming negative is bounded from above by C^{x + y/e) for some 
constant depending on 7. As a consequence, one can construct a coupling between 
Q^(w) and Q^iw) such that 

nQ'^iw) ^ Q'^iw)) < \Awk\^i\Awk\ + V^) . (6.18) 

fc£e[0,i] 

Similarly, regarding the total mass of Q^, one has the bound 

EQf^iw,£)=EQ%w,£^)<C^ ^ |Au;fc|^e^l^""=l , (6.19) 

fe£e[0,i] 

for some constant c. If w is an excursion created by the procedure above, the expected 
values of (6.18) and (6.19) are bounded by C^^/e and C-y respectively, for a possibly 
different constant depending on 7. Proceeding as in Lemma 5.6, it follows that, if we set 

fcee[0,i] fe£e[0,i] 

we obtain the bound E Jg Fg{w)iJ,f^'^"'\d'w) < C„^^, uniformly over s. (But this constant 
might potentially grow very fast as 7 ^ and n — )■ 00!) Combining this with (6.18), 
the bound (6.17) then follows at once. 

As a consequence, it is sufficient to show, instead of (6.16), that 

hm P(/if/"l) = D(/iW) . (6.20) 
This is the content of Proposition 6.20 below, which completes the proof. □ 
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The remainder of this section is devoted to the proof of (6.20). The outline of the 
proof goes as follows. First, in Section 6.2.1, we show that the law of a single excursion 
of the random walk (6.1), conditioned on hitting a prescribed level 7, converges as 
e ^ to the Brownian excursion, conditioned to reach 7. In Section 6.2.2 we then 
provide a general criterion for the convergence of recursive Poisson point processes. 
Finally, in Section 6.2.3 we combine these results in order to obtain (6.20). 

6.2.1 Convergence of excursion measures 

As before, we denote by yt the rescaled random walk given by 



where the are an i.i.d. sequence of random variables with law u. As before, we 
extend this to arbitrary times by Unear interpolation. We also assume that v has some 
exponential moment as before. The aim of this section is to show that if we start yt 
at some initial condition x ^ -^e and condition it on reaching a prescribed height 7 
before becoming negative, then its law converges to that of an unnormaUsed Brownian 
excursion, conditioned to reach height 7 (call it w'^). Throughout this whole section, we 
will only consider the process on a fixed time interval, which for definiteness we choose 
to be equal to [0, 1]. 

A more precise description of the law Q-,, of w'^ is given by the identity 



where Q^,^ denotes the law of a Brownian excursion of length s, conditioned to reach 
level 7, and 

g^{s) = Qs,o({w : supj<^ wt > 7}) = Q({w : &Wt<i > 7/v^}) . 

where Q is the standard Ito excursion measure. Since g^is) — J- exponentially fast for 
small s, this does indeed define a probability measure on C(R+, R). We then turn this 
into a probability measure on C([0, 1], R) by restriction. 

We view C([0, 1], R) as a subset of £ via the injection l : C([0, 1],R) ^ £ given by 



and by setting the path component of iw equal to w, stopped when it reaches the time 
c(iAv). We endow £ as before with the distance d given in (5.24). Since we only deal 
with excursions starting at and stopped before time 1, the distance d is equivalent on 
this set to the (pseudo-)distance d given by 



Regarding the space C([0, IJ, R), we endow it throughout this section with the metric 



y(k+i)e = Vks + Vs^k+i , 



/o°"s ^/^ff^(s)Q^,-y( ■ ) rfs 
s-3/2.g^(s) ds 



s(tw) = , 



t(iw) = 1 A mf{t > : w(t) = 0} , 




diw, w') = 1 A sup \wt — . 
te[0,i] 



We furthermore denote by || • ||d the Wasserstein-1 metric associated with any distance 
function d, which is just the dual of the corresponding Lipschitz (semi-)norm. The main 
theorem of this section is given by the following: 
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Theorem 6.12 Let x > and denote by Q| ^ the law of the random walk (6.1), starting 
at z = ^x and conditioned to reach level 7 before becoming negative. Then, for every 
P < jQ there exists a constant C such that 



uniformly over all e e (0, 1], all a; e [0, e ^/^], and all 7 e \e^l^^ , 1]. 

Our main abstract ingredient in the proof is the following criterion for the conver- 
gence of conditional probabilities when the probabiUty of the set on which the measures 
are conditioned converges to 0: 

Lemma 6.13 Let /i and tt be two probability measures on some Polish space y with 
metric d and diameter 1 and let : 3^ ^ [0, 1] and : 3^ — > [0, 1]. For g > 0, set 

Ag = {xey -.Byey with V^{y) > V^{x) + d{x, y)/Q} , 
Ag^{x&y : d(x, Ag) < g} . 

Assume that S, Si and £2 are such that 

/ 'D^ix)widx) >S, Wfj,- n\\d. < e\ , sup \Dt,(x) - ©^(a;)] < £2 , 

Jy X 

where \\ ■ \\d denotes the Wasserstein-1 distance with respect to d. 
Define measures fl and tt by 

KA) = c^ I 'Di^ix)n{dx) , niA) = c„ / 'D^(x)'K{dx) , 
J A J A 

where and are such that these are probability measures. 
Then, the bound 

U - < ^ + £2 + 2^(4)) , 



(6.21) 



holds for every g < 1. In particular, one has jy Vfj^{x)fi{dx) > whenever the right 
hand side in (6.21) is strictly smaller than 1, so that the bound is non-trivial. 

Proof. Let / : 3^ R be a test function such that Lip^(/) < 1. Since the diameter of 
y is assumed to be 1, we can assume without loss of generality (by possibly adding a 
constant to /) that sup^. |/(a;)| < |. Since one has the identity 



IIA-Trlld = sup (ci^ f(x)Vi^(x)iJ,idx)-CT, f(x)V^(x)n(dx))= sup If, 

our aim is to bound I/, uniformly over /. Note first that, by the bound on /, 

+ V / |2^M(a;)-2^7r(a;)|Mrfa;) 
fix)T>^ix) nidx) - / fix)T>^ix)Tridx) . 



^ - 2 



1 1 
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Note first that the first term is nothing but a particular instance of the last term with 
/ = |. Since the second term is fiirthermore trivially bounded by £2/(27), it suffices to 
bound the last term. 

The problem in bounding this term arises of course from the fact that 2?^ is not 
Lipschitz continuous. For any ^? > 0, we can however "mollify" this function by setting 



d(x, y) 



y(zy\ Q 

It is then straightforward to check that Lip^(2?|) < that 2?^(.r) < V^{x?) < 
supy V^^iy), and that furthermore T>^{x) = V^^ix) for all x ^ Ag. It then follows from 
the definition of ei that 

j f(x)Ve^(x)ii(dx)- J f(x)V^^(x)TT(dx)\<eiil + (2g)-') . 

Furthermore, 



f{x)V^^{x)n{dx)- j f(x)VM)n(dx) 

and similarly for the term with tt replaced by fi. In order to bound iJ,{Ag), we set as 
above 

F^(.) = sup(ivy)-^), 

so that 

< ^ F<^(x)]i{dx) J F%x)-K{dx) < ^ +7r(^e) , 

where we used the fact that vanishes outside of Ag. Collecting all of these bounds, 
the claim follows. □ 

An alternative description of is given by the following. Let F be a Bessel-3 
process starting at the origin and let r-y be its first hitting time of 7, i.e. = inf {t > : 

> 7}- Let furthermore i? be a Brownian motion independent of Y, which is stopped 
when it reaches the level 7. Then, one has the decomposition 

-y f Yt fort<r^, 
Wt ={ D f -.^ (6.22) 
* \ 7 - Bt-r-, fort > T,^. 

This is a consequence of the symmetry of the Brownian excursion under time reversal, 
combined with [RW94, Theorem 49.1] for example. 

We can use the above decomposition to obtain the following boimd: 

Lemma 6.14 For any (5 < \, there exists a constant C such that, for every G (0,1], 
one has the bound 

||Q^-Qy|U<C|7-7?- 



Proof. The decomposition (6.22) suggests a natural coupling between and ws> by 
building them from the same basic building blocks Y and B. The characterisation of 
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the Bessel-3 process as the norm of a 3-dimensional Brownian motion, together with 
standard hitting estimates for Brownian motion, imply that 



P(|r^-Ty| >C)< lA 



c\i-i\ 

^3/2 



so that in particular P(|T-y — r-y' I > -^/py-^T^) < C| 7 — 7']^/^. The result now follows 
from the fact that both B and Y are almost surely a-H6lder continuous for every a <\. 

□ 



Lemma 6.15 Let BJ be a Brownian motion starting at z, conditioned to hit 7 before 
0, and stopped upon its return to 0. Then, for every (3 < 1, there exists a constant C 
depending on j3 such that the bound 

\\V{B2)-Q^\U<C^e^ , 

holds uniformly over e e (0, 7 A 1] and 7 > 0. 

Proof Let w'^ be as above and let Ts be the first passage time of w"< through e. Then, it 
follows from the decomposition (6.22) that one has the exact identity 

B2{.) '= w-'i- - Te) . (6.23) 

It then follows from the small ball estimates of Brownian motion that, for every C < 2, 
one has the bound 

P(Te >e^)<e. 

The desired estimate then follows at once from the Holder regularity ofw'*. □ 

Proposition 6.16 Let a G (0, |). Suppose further that 7 > is fixed and denote by 
yl the random walk yt conditioned to hit [7, 00) before hitting R_ and stopped when 
it then hits R_. Assume that = e". Then the law ofy'^ converges weakly to Q-y as 
£ — )• 0. Furthermore, for every /3 > 0, there exists a constant C such that the bound 

II W) - Q7IU < C(^/7£^-" + £""0 . 
holds uniformly over alls < 1 and 76 [e", 1]. 

Proof By Lemma 6.15, it suffices to compare the law of y'^ with that of B^c , which is 
itself going to be a consequence of Lemma 6. 13. 

To see this, we partition the state space y = {w € C([0, 1], R) : = £"} into 
three sets in the following way. Given a continuous function w with wq S (0, 7), we set 
r = 1 A inf{t > : ^ [0, 7]}, and we define sets with i e {1, 2, 3} by 




^(1) = {w : Wt=Q} , A^"^^ = {w : Wr = l} , A*^) = [w : t = 1} 
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Define furthermore functions Fry and on y by 

if w e A^^\ 

where ^ is defined as in (5.13). With these definitions at hand, if we set /i = 'D{y) 
with yo = e", TT = V^B^c), = F^, and T>„ = Fj, then we are precisely in the 
setting of Lemma 6.13 with /t = 'D{y'^) and tt = 'D{B]c.). 

Note first that, since Fj is precisely the probability that a Brownian motion started 
from wi hits 7 before 0, we have 



e 

(dw) = — 
7 



Furthermore, it follows from Corollary 5.2 (and from the fact that F^ and F^ coincide 
outside of that 

1 

si = sup \F.yiw) - F^iw)\ < — . 

w 7 

(We could have replaced \ by any exponent less than \ here.) Regarding the distance 
between the unconditioned measures, we obtain from [Fra73] the bound 

£2 = \\^l-^T\\d < £5 . 

It thus remains to obtain a bound on Ag. By the definition of and of F^, w & A^ 
impUes that either w e A^'^^ U A*^' and d{w, A^"^^) < goTw € A^^'> and d{w, A^^^) < g. 
This implies that 

Ag C \ w : sup U't € [j — g,^] \ U \ w : inf wt <E [-g. 0| > , 
L te[o,i] *6[o,i] J 

so that 

AgClw: sup Wt el'j -2g,^ + g]\LI Iw : inf Wt £ [-2g, g]\ . 
I te[o,i] *e[o,i] J 

Since (by the reflection principle) the law of the extremum of Brownian motion over a 
finite time interval has a smooth density with respect to Lebesgue measure, there exists 
a constant C independent of e and S such that iriAg) < Cg. 

Inserting these bounds into Lenmia 6.13, we thus obtain the bound 

\\Viy^)-ViB].)\U<^(^+ei+g). 

Setting g = e^-y'i completes the proof. □ 

We now have all the ingredients required for the proof of Theorem 6.12. 

Proof of Theorem 6.12. Assume as in previous proof that yj is the random walk yt 
conditioned to hit [7, 00) before hitting R_ and stopped when it then hits R_ . In 
contrast to the above setup we now assiune that y^ = z^/e for some 2: > 0. Let fco be 
given by 

ko = inf{A;> : yl^ > . 
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It then follows from [Fra73], combined with standard small ball estimates for Brownian 
motion that, for every ,8 < § and every p > there exists a constant C such that 

P{sko > < sP , (6.24) 

uniformly over e < 1, for all z such that z^/e < e"^ . Furthermore, the probability that 
yloe ^^"^ (say) is exponentially small in e, again uniformly over x. It then follows 
from Proposition 6.16 (choosing a = j^) that, for every P < one can construct a 
joint realisation of y'^ and w'^ such that 

Ed{y'',w''(--eko)) < . 

(Here we extend w'^ by setting it to for negative times.) On the other hand, the Holder 
regularity of the sample paths of w'' together with (6.24) implies that 

so that the bound on ||Q| , — Q-ylld follows. 

In order to obtain the bound on || i*Ql^^ — i-*Q-y ! | make use of the same coupling 
between y'^ and w'^ as above, so that Ediy'^, w'<') < e^. It then remains to obtain a 
bound on 

|e(i2/^) - z{lw'^)\ . 
For this, note first that, by Chebychev's inequaUty, one has the bound 

nd{y\w'^)>e^)<e^-^ , (6.25) 

valid for every /3 < ^. Consider now any two paths yi' and w'^ at distance less than 
and define 

n = 1 A inf{t > : w^{t) < e^} , r2 = 1 A inf{t > n : w^{t) < -e^} . 

In this way, one has both z{lw'^) G [ti,T2] and e{Ly'^) G [ti, T2], so that it remains to 
obtain a bound on T2 — ri . The explicit expression for the hitting time of a line for a 
Brownian motion yields 

P(T2-ri >s°')<e^-^ , 

for any a < 2(5. Choosing fi ~ ^ and a = ^ and combining this with (6.25), we 
then obtain 

P(|e(t2/^)-e(t«;'>')| >£¥)<£^ , 
from which the bound follows. □ 

6.2.2 Convergence of recursive Poisson point processes 

The aim of this section is to provide a general result allowing us to bound the distance 
between two recursive Poisson point processes of the same depth n in terms of their 
respective kernels. This result is be the main abstract result on which the proof of the 
convergence result (6.20) will be based. One difficulty that we have to overcome is that 
there is very Httle uniformity in the proximity of the kernel describing /x^''^"^ to the one 
describing 
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Throughout this section, given a PoHsh space X with a distance function d bounded 
by 1, we define the Wasserstein-1 distance between any two positive measures with 
finite mass (and not just probability measures!) by 

||/i-7r||i= sup (/ f(x)iJ,(dx)- f(x)'!r(dx)) . 

Lip/<1 ^Jx Jx ' 



11/11 oo<l 



If /u and TT happen to have the same mass, then the additional constraint on the supremum 
norm ||/||oo of / is redundant in the above expression, and we simply recover the usual 
Wasserstein-1 distance. In the case where the masses of yit and tt are different however, 
our choice of definitions ensures that 



|M-7r||i«|/xW-7r(Af)| + |MW| 



\1 TT 



(6.26) 



where « denotes that both quantities are bounded by multiples of each other, with 

proportionality constants that are independent of /i and tt. 
The main result of this section is the following: 

Theorem 6.17 Let Q and Q: X ^ M+(<Y) be two measurable maps and assume that 
the Polish space X is endowed with a metric d bounded by 1. Let A <Z X, e & {Q, 1] 
and K >lbe such that the bounds 

&nvQ{x,X)<K, \\Q{x,X)-Q{y,X)\\i<Kd{x,y), (6.28) 

sup Q(x, A')<e, sup \\Q{x) - Q{x)\\i < e , (6.29) 

xeA xeA 

hold, where we use the notation A'^ = X\A. 

Fix n > 0, X € A and x G X, and denote by /i^"^ and p}-^^ the recursive Poisson 
point processes with respective kernels Q and Q. Then, there exists a coupling between 
pS^^ and /ij"^ such that 

E(l A IImL"^ - Jxt\) < C(^/i + d(x, X)) , 

where the proportionality constant C depends only on K and n. 



Remark 6.18 One useful feature of the way that we set up the bounds in the statement 
is that we only require information about the kernel Q on the set A. In the application 
we have in mind, the kernel Q will be the one describing /i^ '"^, while the set A will 
consist of trajectories exhibiting "typical" behaviour at small scales. 

Before we turn to the proof of this theorem, we show that if 7r„ tt in the 
Wasserstein-1 sense, then the (usual) Poisson point processes with these intensity 
measures also converge to each other weakly in the Wasserstein-1 distance: 

Proposition 6.19 Let tt and tt be two finite positive measures on a Polish space X 
endowed with a metric d < 1 and let /x and p, be the corresponding Poisson point 
processes on X. Then, there exists a constant C and a coupling between p. and p. such 
that 

E(lA||M-/i|li)<C(||7r-7f||iAl). 
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Proof. The proof relies on the fact that, if ViX) denotes the Poisson distribution with 
parameter A, one has the total variation bound 

diw{V{X), V{X)) < I A - A| A 1 , (6.30) 

see for example | AJ06|. 

We can construct /x (and similarly for in the following way. First, draw a Poisson 
random variable N with parameter ■k{X). Then, draw N independent random variables 
{Xi,. . . , Xjv} with law -Kf-KiX) and set 

N 



^i = Y^5x,. (6.31) 



fe=i 

By (6.30), we can now construct a Poisson random variable N with parameter tt{X) 
such that 

P(7V ^ AT) < \it{X) - Tt{X)\ . (6.32) 

Assuming that N = N,we. can then draw random variables {Xi , . . . , Xpf} in such a 
way that the pairs (Xk, X/.) are distributed according to a coupling between n/niX) 
and w/TtiX) that minimises their expected distance. If N ^ N, then we simply draw 
{Xi, . . . , Xn} according to n/Tr(X), independently of the Xk. 
If we then define p, similarly to (6.31), it follows that 



N + N otherwise. 



As a consequence, we obtain the bound 

E(lA||M-/i||i)<P(VV7^iV). 



EN 

1 



niX) TtiX) 

so that the claim follows from (6.32), the definition of N, and (6.26). □ 

Proof of Theorem 6.17. Note first that, by combining the first bound in (6.28) with the 
second bound in (6.29), we obtain the bound 

sup Qix, X)<K + e . 

It follows that we have the recursive bound 

nnliA) I n^r^A^) = 0) < 2KE{jll-\A) I Ji^r\A^) = 0) , 
so that, since jl^iA) = 1 by assumption, 

Eifi^iA) I = 0) < i2Kr . (6.33) 

On the other hand, defining the positive measures tt" and tt " by 

7r"+i = / Q{y)iil{dy) and 77"+^ = / Q{y)i4{dy) 
Jx Jx 
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Pift\A') > 0) < P(mL"" 




<p(Mr 










-^\A'') > 




-^\A') > 



-In-l] 



(A') = 0) 



we have the inequahty 

:mL"-"(^'=) > 0) + Pifl^iA') > 1 mL""^ 
hE(7f"(A'=)|4"-^l(yl' 
^sD + P{fl^-\A) > 

which is vahd unifonnly over all D > 0. Choosing D l/\/e, we thus obtain the 
recursion relation 

Pip)-^\A') > 0) < P(/2L"""(^') > 0) + , 
from which it follows that 

P(/ll"'(A'^) > 0) < C^, 

where in both cases the constant C depends on K and n, but not on e. 
Note now that we have the bound 



(6.34) 



— TT 



n+ll 



< 



X 



Q(y) - limy) 



\Q(y) - Q(y)\\ii22(dy) 



X 



< Will - /2g||i(Lip(Q) + IIQIU) 



+ 



■ / ||Q(2/)-Q(2/)||iMg(rfa;), 

J A" 

so that the bound 

1 A ||7r"+i - 7f"+i||i < impil - /iglli A 1) + ei? + lpn(^)>^ + l^g(^c)>o , 

is valid for every £) > 0. Furthermore, by Chebyshev's inequality and (6.33)-(6.34), 
one has 

P(M^(A) >D)< miiA) > D I Ar '(^') = 0) + P(/Sr '(^') > 0) 
Choosing again D ~ y/e, we thus obtain the bound 



E(l A IItt^ 



n+l 



7f"+ii) < 2KE{\\n: - M^lli A 1) + CVi . 



Applying Proposition 6.19, we conclude that, given /u" and /i^, it is possible to 
construct a coupling between and /l^^^ such that 

E(l A \\f,:+^ - < E(l A \\n: - fi^h) + , 



from whence the claim now follows at once. 



□ 
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6.2.3 Convergence of the truncated distributions 

We are now in a position to provide the proof of (6.20). Again, throughout this section, 
we make the standing assumption that the one- step distribution v for the random walk 
(6.1) has some exponential moment. We also use the notations /i^'^"^ and /ij^"^ from 
(6.20). Let £ be the space of real-valued excursions as before. We then introduce the 
map M+(R2) given by 

¥{w){dz,dt) = ae ^ e"^^5efc(dt)l[o,A|«,](^)(i^ , (6.35) 

Efceio,!] 

where 8^, denotes the Dirac measure located at z, A'^w^ is defined by 

Te ' 

and we used the convention that l[o,z] = if < 0. 

As before, denote by the law of the random walk (6.1), starting at z-^e, and 
conditioned to hit level 7 before becoming negative. We stop it as soon as it hits R_, so 
that we interpret QJ ,^ as a measure on 5o- Recall also that P^^/y^, with P^^^ defined 
as in Section 5.1.2 denotes the probabiUty that this event actually happens. 

With this notation, the measure describing /i^'f"^ (i.e. Q^(w, •) is the intensity 
measure of Q^(ti;)) is given by 

o^r,„ ^ f (3>* n'^ P ^'(w)(dz, dt) 
y (w, •) = / B„ j= 



Note now that if tz; is a typical reaUsation of QJ,^, then J^(w) is expected to be close to 
the measure 

J(w) = li(w)n[0,i]it) dt (g) i>idz) , 
where i> is the measure on R+ given by 



//•OO /^Z 
Giz)uidz)= / / G{y)dyy{dz), 
Jo Jo 



for any test function G. This is because e°^^^™ ~ 1 and the law of A|w would be 
given by u, were it not for the conditioning. 

On the other hand, the kernel Q-y describing the truncated Brownian fan /uj^"^ with 
parameter a is given by 



/ e* ,,Q^ dt=] f Giz) JiwXdz, dt) , (6.36) 

Jl(w)n[OA] " JR2 



where Q-^ is the law of a Brownian excursion conditioned to reach level 7. This is the 
case because of the well-known fact that ^Q-., is the law of the unnormalised Brownian 
excursion restricted to the set of excursions reaching level 7. The second identity in 
(6.36) is a consequence of the definition of p, combined with Proposition 5.3. With 
these notations at hand, we have: 
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Proposition 6.20 Consider the setting and assumptions of Theorem 6.11. For every 
(5 < there exists a constant C depending on 7, n and 6 such that 

E(lA||/i;'M-MM||i)<(7£*, 
uniformly over e < sofor some eq small enough. 

Proof We apply Theorem 6.17 with Q = Q^, Q = Q^, and A to be determined. Using 
the results obtained earlier in this section, it turns out that the assiunptions are then 
relatively straightforward to check. 

For convenience, we introduce the notation 

fHw)(dz, dt) = G{x)fHtv){dz, dt) , 

and similarly for J. We also denote by 112 : ^ R the projection onto the second 
component, so that n2J^(w) (and similarly for J) is the projection of J'^(w) onto the 
f-component. We also denote by fie C f the subset of excursions given by 

Oe = ^^e^ n Of , 

where we set 

f^W ^{w^£ : f^(w){{x > e-^/^}) = 0} , 

nf^ = {we£ : ||n*J(w) - n*J?(w)||i < e^/^°} . 

Also, in the definition of the Wasserstein-1 distance is taken with respect to the 
somewhat unusual distance on R given by 

dit,t') = lA\t-t'\^/^. (6.37) 

The reason for this seemingly strange choice will become clear later. The set fig defined 
in this way will play the role of A when applying Theorem 6.17. 

Note first that Q-yiWjX) < which is bounded independently of w. Furthermore, 
one has the bound 

\\Q.yiw,X)-Q-,iw,X)\\,<^ [ ||e;,,Q^-e*,,,Q^||irfi 

^7 Ji(tu)ni(tD)n[o,i] 

+ ^{\siw) - siw)\ + \tiw) - t{w)\) . 

Since furthermore ||6* ^Q-y — 6^ tQ7l|i < \wt — wt\, it does indeed follow that Q is 
globally Lipschitz continuous as required, so that (6.28) holds with some constant K 
depending on 7. 

It remains to check that (6.29) holds, with e replaced by some power of e. By 
Theorem 6.12, we already know that there exists a constant C, possibly depending on 7, 
such that 

IIQ^.^-QtIIi <C£*. 

for any exponent (5 e (0, j^), and uniformly over all ^; < e~^l^. As a consequence, for 
every w e O^, we have the bound 

||Q^(«;,Af) - Ql^{w,X)\\^ < - f i^P.,^ - G{z)) ¥{w){dz,dt) 
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+ - / \\Q*n,M,J - ^w,tQ-y\\ nwXdz, dt) 

+ - / e* ,tQ^(n*/-(«;)(dt) - n*j(«;)(di)) 

+ C^£^/i°(l + Lipe;,,Q^), 
where Lip 6* jQ^ denotes the quantity 

Lip e^,Q, = sup i^|,_,,|V3 • 

It is at this stage that the choice (6.37) of distance function becomes clear: with respect to 
the usual Euclidean distance, the map t !-> 9* ^Q-^ would not be Lipschitz continuous. 
In this way however, it follows immediately from the Holder continuity of Brownian 
motion that Lip 6* jQ.y < oo. 

Furthermore, it follows from the definition of fi*.^' that ,J^(w)(R^) is bounded by a 
constant independent of w and e. Since G{x) is bounded from below by a constant, this 
implies that the same is true of J^(w)(R^). Combining these bounds, we then obtain 

\\Q.,{w,X)- Ql^{w,X)\\^ < C^e^ , 

for some constant C and any 6 < j^. 

In order to complete the proof, it thus remains to show that inf^go^ Q^iw, f2|) 
sufficiently fast as e — )■ 0, where denotes the complement of il^. As a consequence 
of the definition of ilg, this follows if we can show that QJ^(ri^) — >^ as £ — >^ 0, 
uniformly over all z < e~^^^. Instead of considering QJ^, it is much easier to consider 
Q|, the law of the rescaled random walk (6. 1) over the time interval [0, 1]. 

Observe that QJ is obtained from QJ by first conditioning on the event that 7 is 
reached before the walk becomes negative and then stopping the walk. By Proposi- 
tion 5.1, the probability of this event is bounded from below by C\/i for some constant 
C depending on 7 but independent of z G [0, £~^/^]. Therefore it is sufficient to find a 
set with the following two properties: 

• There exists an exponent C > 5 and a constant C such that QliX \ S^) < Cs'' 
for every a <1 and every z G [0, e~^/'^]. 

• For every w and every t G [0, 1], the path w obtained from w by stopping 
it at time t belongs to fig. 

In order to determine whether a path w belongs to S^, we "coarse-grain" it into pieces 
of length e^/"* (the precise exponent is not very important as we do not endeavour to 
obtain optimal convergence rates) and we impose that the contribution of 3'^{w) on 
each piece is very close to what it should be. In other words, setting Ik C [0, 1] by 
Ik = [e^/^k, £^/^{k + 1)), we define as 



= n« n [w : |n*j'-(M;)(7fc) - 1 14 1 



< 



where denotes the length of the interval Ik- 

We first note that if w e Se and t e [0, 1], then the path w obtained by stopping 

w at time t does belong to fi^. Since J^(w) < J^(w), w G Vl^P implies that w E fl^J-K 
Denote now by 77^ the measure r]^ = 1X2 J'^(w) and by 77 the target measure, namely 



References 



70 



V = t-^|[o.t]> where A denotes the Lebesgue measure. By the assumption on J^(w), it 
follows that one has \r]^ilk) — vilk)] < y/s for each of the intervals Ik, except possibly 
for the interval containing t. As a consequence, denoting by tjI and •qk the restrictions 
of 7]^ and T] to Ik, one obtains for all k such that t ^ Ik the boimd 

||?7fe-?7fe||i <£5 + l +£5 . 

(Recall that we use the distance function (6.37), this is the reason for the exponent i.) 
Summing over k and using the fact that ||?7| — ?7fej|i < Ce^/"^ for the value k such that 
t £ Ik, it follows that one has \\ri^ — ^ Ce^/^, which indeed impUes that it; S il^, 
at least for e small enough. 

It remains to show that Q%(£ \ Se) ^ Ce'' for sufficiently large (. Observe first 
that Ql(£ \ fi^i*) < CeP for every p > thanks to the fact that the distribution u of the 
steps of our random walk has exponential tails. 

To obtain the required bound on Ql{£ \ 5^), we use the fact that H^J^iwXIk) 
consists of the sum of £~^/^ i.i.d. copies of a random variable Y with law given by 



l-z 

Y = as e''^'G(z)dz, V{Z) 
Jo 



where u is the one-step distribution of the underlying random walk. Note now that, as a 
consequence of Proposition 5.3 and the fact that v admits some exponential moment, 
one has 

EF= |e + 0(£3/2), 

for all £ sufficiently small. Similarly, it is straightforward to check that E|y|P = O^e'^) 
for every p > 0. As a consequence, one has for every p > the bound 



E 



a , 



np^iwxik) ~ -\ik\ 



for a constant possibly depending on p. It immediately follows that 

P(\up~-iw)ilk) -^\Ik\\>V^)<CeP, 

for every p > 0. Summing over k and combining this with the previously obtained 
bound on Ql(£ \ fi^^^), we conclude that QliS \ S^) < Ce^ for every p > 0, which 
concludes our proof. □ 
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